


NATURAL PHILOSOPHY 

OF 

CAUSE AND CHANCE 




NATURAL PHILOSOPHY 

OF 

CAUSE AND CHANCE 


BY 

MAX BORN 


BEING 

THE WAYNFLETE LECTURES 

DELIVERED IN THE COLLEGE OF 


ST. MARY MAGDALEN, OXFORD 
IN HILARY TERM 
1948 


OXFORD 

AT THE CLARENDON PRESS 
1949 




Oxford University Press, Amen House , London E.G. 4 

GLASGOW NEW YORK TORONTO MELBOURNE WELLINGTON 
BOMBAY CALCUTTA MADRAS OAPE TOWN 

Geoffrey Oumberlege , Publisher to the University 


PRINTED IN GREAT BRITAIN 



PREFACE 


A dbaft of these lectures, written before they were delivered, 
contained considerably more technicalities and mathematics than 
the present text. Facing a large audience in which physicists 
and mathematicians were presumably a minority, I had to 
change my plans and to improvise a simplified presentation. 
Though this did not seem difficult on the platform of the Hall 
of Magdalen College, Oxford, the final formulation for publi- 
cation was not an easy task. I did not like replacing rigorous 
mathematical reasoning by that mixture of literary style, 
authority, and mystery which is often used by popularizing and 
philosophizing scientists. Thus, the idea occurred to me to 
preserve the mathematics by removing it to a detailed appendix 
which could also contain references to the literature. The vast 
extension of the latter, however, compelled me to restrict quota- 
tions to recent publications which are not in the text-books. 
Some of these supplements contain unpublished investigations 
of my school, mainly by my collaborator Dr. H. S. Green. In the 
text itself I have given up the original division into seven lec- 
tures and replaced it by a more natural arrangement into ten 
chapters. 

I have to thank Dr. Green for his untiring help in reading, 
criticizing, and correcting my script, working out drafts of the 
appendix, and reading proofs. I am also indebted to Mr. Lewis 
Elton not only for proof-reading but for carefully preparing the 
index. I have further to thank Albert Einstein for permission 
to publish sections of two of his letters. 

My most sincere gratitude is due to the President and the 
Fellows of Magdalen College who gave me the opportunity to 
plan these lectures, and the leisure to write them down for 
publication. 

I wish to thank the Oxford University Press for the excellent 
printing and their willingness to follow all my wishes. 


M. B. 
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NOTATION 


The practice of representing vector quantities by means of clarendon 
type in print is now well established, and used throughout these lectures. 
For dealing with cartesian tensors, the notation of Chapman and Milne, 
explained in the first chapter of Chapman and Cowling’s book The 
Mathematical Theory of Non-Uniform Oases (C.TT.P., 1939) is used; this 
consists in printing tensors in sans serif type. 

The follow ing examples will suffice to show how vector and tensor 
equations are translated into coordinate notation. 


a = b — > a k = bfc {h = 1, 2, 3) 

8 

ab =^a k b k 

a = b — > Ojy = bjg (j fc, l = 1, 2, 3) 

8 

a.b = c — > 2 a kibi — c k (h = 1, 2, 3) 

3 

a.b.c 

aAb = c-*a a 6 8 — a*b % = c l9 etc. 

~ == grad a = b -► ^ = (grada) fc = b k {k = 1, 2, 3) 
ox oa jfc 

±.a = diva = j>g* 
fc -1 

8 

^.a = diva = = (diva) fc = ft* (*=1,2,3) 

i-i ' 

^Aa= curia = b -> ^ = (curia)! = 6 l( etc. 
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INTRODUCTION 

The notions of cause and chance which I propose to deal with 
in these lectures are not specifically physical concepts but have 
a much wider meaning and application. They are used, more or 
less vaguely, in everyday life. They appear, not only in all 
branches of science, but also in history, psychology, philosophy, 
and theology; everywhere with a different shade of m e aning . 
It would be far beyond my abilities to give an account of all 
these usages, or to attempt an analysis of the exact significance 
of the words c cause ’ and 4 chance ’ in each of them. However, 
it is obvious that there must be a common feature in the use of 
these notions, like the theme in a set of variations. Indeed, 
cause expresses the idea of necessity in the relation of events, 
while chance means just the opposite, complete randomness. 
Nature, as well as human affairs, seems to be subject to both 
necessity and accident. Yet even accident is not completely 
arbitrary, for there are laws of chance, formulated in the mathe- 
matical theory of probability, nor can the cause-effect relation 
be used for predicting the future with certainty, as this would 
require a complete knowledge of the relevant circumstances, 
present, past, or both together, which is not available. There 
seems to be a hopeless tangle of ideas. In fact, if you look 
through the literature on this problem you will find no satisfac- 
tory solution, no general agreement. Only in physics has a 
systematic attempt been made to use the notions of cause and 
chance in a way free from contradictions. Physicists form their 
notions through the interpretation of experiments. This method 
may rightly be called Natural Philosophy, a word still used for 
physics at the Scottish universities. In this sense I shall attempt 
to investigate the concepts of cause and chance in these lectures. 
My material will be taken mainly from physics, but I shall try 
to regard it with the attitude of the philosopher, and I hope that 
the results obtained will be of use wherever the concepts of 
cause and chance are applied. I know that such an attempt will 
not find favour with some philosophers, who maintain that 
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science teaches only a narrow aspect of the world, and one which 
is of no great importance to man’s mind. It is true that many 
scientists are not philosophically minded and have hitherto 
shown much skill and ingenuity but little wisdom . I need hardly 
to enlarge on this subject. The practical applications of science 
have given us the means of a fuller and richer life, but also the 
means of destruction and devastation on a vast scale. Wise 
men would have considered the consequences of their activities 
before starting on them; scientists have failed to do so, and only 
recently have they become conscious of their responsibilities to 
society. They have gained prestige as men of aotion, but they 
have lost credit as philosophers. Yet history shows that science 
has played a leading part in the development of human thought. 
It has not only supplied raw material to philosophy by gathering 
facts, but also evolved the fundamental concepts on how to deal 
with them. It suffices to mention the Copemioan system of the 
universe, and the Newtonian dynamics which sprang from it. 
These originated the conceptions of space, time, matter, force, 
and motion for a long time to come, and had a mighty influence 
on many philosophical systems. It has been said that the meta- 
physics of any period is the offspring of the physics of the pre- 
ceding period. If this is true, it puts us physicists under the 
obligation to explain our ideas in a not-too-technical language. 
This is the purpose of the following lectures on a restricted 
hough important field. I have made an attempt to avoid 
mathematics entirely, but failed. It would have meant an un- 
bearable clumsiness of expression and loss of clarity. A way out 

2S fJT Z ? ereducti0 * of ^ter mathematics to 

efementary methods in Euclidean style-following the cele- 
brated example of Newton’s Principia. But this would even 
Ckm8illeSS and de ^royed what there is of 

'rX* “ “ m ° re than 200 y ea rs 

SSiUtoSS' t some progress in the assimilation of 
nmthmatics by those who are interested in natural philosophy 

'TV* * * suitable 

SSiH a" Sf proofs of tWein8 (the r « 
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In this way I hop© to explain how physics may throw some 
light on a problem which is not only important for abstract 
knowledge but also for the behaviour of man. An unrestricted 
belief in causality leads necessarily to the idea that the world is 
an automaton of which we ourselves are only little cog-wheels. 
This means materialistic determinism. It resembles very much 
that religious determinism accepted by different creeds, where 
the actions of men are believed to be determined from the 
beginning by a ruling of God. I cannot enlarge on the difficulties 
to which this idea leads if considered from the standpoint of 
ethical responsibility. The notion of divine predestination 
clashes with the notion of free will, in the same way as the 
assumption of an endless chain of natural causes. On the other 
hand, an unrestricted belief in chance is impossible, as it c ann ot 
be denied that there are a great many regularities in the world ; 
hence there can be, at most, ‘regulated accident’. One has to 
postulate laws of chance which assume the appearance of laws 
of nature or laws for human behaviour. Such a philosophy 
would give ample space for free will, or even for the willed 
actions of gods and demons. In fact, all primitive polytheistic 
religions seem to be based on such a conception of nature : things 
happening in a haphazard way, except where some spirit inter- 
feres with a purpose. We reject to-day this demonological 
philosophy, but admit chance into the realm of exact science. 
Our philosophy is dualistic in this respect ; nature is ruled by 
laws of cause and laws of chance in a certain mixture. How is 
this possible ? Are there no logical contradictions ? Can this 
mixture of ideas be cast into a consistent system in which all 
phenomena can be adequately described or explained ? What 
do we mean by such an explanation if the feature of chance is 
involved ? What are the irreducible or metaphysical principles 
involved ? Is there any room in this system for free will or for 
the interference of deity ? These and many other questions can 
be asked. I shall try to answer some of them from the stand- 
point of the physicist, others from my philosophical convictions 
which are not much more than common sense improved by 
sporadic reading. The statement, frequently made, that modem 
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physios has given, up causality is entirely unfounded. Modem 
physios, it is true, has given up or modified many traditional 
ideas ; but it would cease to be a science if it had given up the 
search for the causes of phenomena. I found it necessary, 
therefore, to formulate the different aspects of the fundamental 
notions by giving definitions of terms which seem to me in agree- 
ment with ordinary language. With the help of these concepts, 
I shall survey the development of physical thought, dwelling 
here and there on special points of interest, and I shall try to 
apply the results to philosophy in general. 



II 

CAUSALITY AND DETERMINISM 

The concept of causality is closely linked with that of determin- 
ism, yet they seem to me not identical. Moreover, causality is 
used with several different shades of meaning. I shall try to dis- 
entangle these notions and eventually sum them up in definitions. 

The cause-effect relation is used mainly in two ways ; I shall 
illustrate this by giving examples, partly from ordinary life, 
partly from science. Take these statements : 

Overpopulation is the cause of India’s poverty. ’ 

The stability of British politics is caused by the institution 
of monarchy.’ 

Wars are caused by the economic conditions.’ 

‘There is no life on the moon beoause of the lack of an 
atmosphere containing oxygen.’ 

Chemical reactions are caused by the affinity of molecules.’ 

The common feature to which I wish to draw your attention 
is the fact that these sentences state timeless relations. They 
say that one thing or one situation A causes another B, moaning 
apparently that the existence of B depends on A, or that if A 
were changed or absent, B would also be changed or absent. 
Compare these statements with the following: 

‘The Indian famine of 1946 was caused by a bad harvest.’ 
The fall of Hitler was caused by the defeat of his armies.’ 
‘The American war of secession was caused by the economic 
situation of the slave states.’ 

‘Life could develop on earth because of tho formation of an 
atmosphere containing oxygen.’ 

‘The destruction of Hiroshima was oaused by tho explosion 
of an atomic bomb.’ 

In these sentences one definite event A is regarded as the 
cause of another B ; both events are more or less fixed in space 
and time. I think that these two different shades of the causo- 
effect relation are both perfectly legitimate. The common factor 
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is the idea of dependence, which needs some comment. This 
concept of dependence is clear enough if the two things con- 
nected are concepts themselves, things of the mind, like two 
numbers or two sets of numbers ; then dependence means what 
the mathematician expresses by the word ‘function’. This 
logical dependence needs no further analysis (I even t.Tiinir it 
cannot be further analysed). But causality does not refer to 
logical dependence ; it means dependence of real things of nature 
on one another. The problem of what this means is not simple 
at aJl. Astrologers claim the dependence of the fate of human 
beings on the constellations of stars. Scientists reject such state- 
ments— but why ? Because science accepts only relations of 
dependence if they can be verified by observation and experi- 
ment, and we are convinced that astrology has not stood this 
test. Soience insists on a criterion for dependence, namely 
repetitive observation or experiment : either the things A and B 
refer to phenomena, occurring repeatedly in Nature and being 
sufficiently similar for the aspect in question to be considered 
as identical; or repetition can be artificially produced by 
experiment. 

Observation and experiment are crafts which are systemati- 
cally taught. Sometimes, by a genius, they are raised to the 
level of an art. There are rules to be observed: isolation of the 
system considered, restriction of the variable factors, varying 
of the conditions until the dependence of the effect on a single 
factor beoomes evident ; in many cases exact measurements and 
comparison of figures are essential. The technique of handling 
these figures is a craft in itself, in which the notions of chance 
and probability play a decisive part— we shall return to this 
questionatalater stage. Soitlooks as if science has a methodical 
way of finding causal relations without referring to any meta- 
physical principle. But this is a deception. For no observation 
or experiment, however extended, can give more than a finite 
number of repetitions, and the statement of a law — B depends on 
A always transcends experience. Yet this kind of state- 
ment is made everywhere and all the time, and sometimes from 
scanty material. Philosophers call it Inference by Induction, 
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nd have developed many a profound theory of it. I shall not 
nter into a discussion of these speculations. But I have to 
lake it clear why I distinguish this principle of induction from 
ausality. Induction allows oneto generalize a number of observa- 
ions into a general rule : that night follows day and day follows 
ight, or that in spring the trees grow green leaves, are induo- 
ions, but they contain no causal relation, no statement of 
ependence. The method of inductive thinking is more general 
aan causal thinking ; it is used in everyday life as a matter of 
aurse, and it applies in science to the descriptive and experi- 
lental branches as well. But while everyday life has no definite 
nterion for the validity of an induction and relies more or less 
a intuition, science has worked out a code, or rule of craft, for 
s application. This code has been entirely successful, and I 
link that is the only justification for it— just as the rules of the 
raft of classical music are only justified by full audienoes and 
pplause. Science and art are not so different as they appear, 
he laws in the realms of truth and beauty are laid down by the 
tasters, who create eternal works. 

Absolute values are ideals never reached. Yet I t.hinlr that 
le common effort of mankind has approached some ideals in 
aite a respectable way. I do not hesitate to call a man foolish 
he rejects the teaching of experience because no logical proof 
forthcoming, or because he does not know or does not accept 
to rules of the scientific craft. You find such super-logical 
sople sporadically among pure mathematicians, theologians, 
id philosophers, while there are besides vast communities of 
jople ignorant of or rejecting the rules of science, among them 
ie members of anti-vaccination societies and believers in 
itrology. It is useless to argue with them; I cannot compel 
iem to accept the same criteria of valid induction in which I 
»lieve: the code of scientific rules. For there is no logical 
gument for doing so ; it is a question of faith. In this sense I 
u willing to call induction a metaphysical principle, namely 
mething beyond physics. 

After this excursion, let us return to causality and its two 
ays of application, one as a timeless relation of dependence, 
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the other as a dependence of one event fixed in time and space 
on another (see Appendix, 1). I think that the abstract, timeless 
meaning of causality is the fundamental one. This becomes 
quite evident if one tries to use the term in connexion with a 
specific case without implicit reference to the abstraction. For 
example : The statement that a bad harvest was the cause of the 
Jndtan famine makes sense only if one has in mind the timeless 
statement that bad harvests are causes of famines in general. 
I leave it to you to confirm this with the other examples I have 
given or with any more you may invent. If you drop this refer- 
ence to a general rule, the connexion between two consecutive 
events loses its character of causality, though it may still retain 
the featere of perfect regularity, as in the sequence of day and 
night. Another example !s the time-table of a railway line. You 

r/rtt ^ h6lp the arrival at King’s Gross of the 
10o clockfrom Waverley; but you can hardly say that the time- 

table reveak a cause for this event. In other words, the law 

ZnnVTl S W d6tenninistic Y °* can predict future events 
from it, but the question ‘why?’ makes no sense. 

Therefore, I think one should not identify causality and 

nr^r 1 ";, 1 referS t0 "*• *** allow one to 

eve^ Wl6dge ° f an 6Vent A the occurrence of an 

nW i 1,™ V6rSa)> but ™ thout the idea that there is a 
physical timeless (and spaceless) link between aU things of the 

kind A and all things of the kind B. I prefer to use the ex- 

STctr ‘rr ality, . ma ^ f0r ^ timeless dependence. It is 
2 if. 6Xp i enmentalist8 “d observers mean when they 
traw a certam phenomenon to a certain cause by systematic 

~t,1^ dition, • The oth,t w-*- - * stst 

Mkwng one another is, however, in so common nse 
Tharrf ““I=ogg«tthatitshonldhe 
< S? fc r ntod ““ 'ottribntes- concerning 

tnne and space. Itaalwaysassnmedtliatthecanseprecedeethf 
rfb^Ipr^osetomdlthistheprincipieof antecedent 
^^^^odsampngnanttoassumeathingtocnsess; 

ortow hichitoannotbe 
y er gs , I shall call this the principle of contiguity. 
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I shall now try to condense these considerations in a few 
definitions. 

Determinism postulates that events at different times are 
connected by laws in such a way that predictions of un- 
known situations (past or future) can be made. 

By this formulation religious predestination is excluded, since 
it assumes that the book of destiny is only open to God. 

Causality postulates that there are laws by which the occur- 
rence of an entity B of a certain class depends on the 
occurrence of an entity A of another class, where the word 
‘entity 5 means any physical object, phenomenon, situation, 
or event. A is called the cause, B the effect. 

If causality refers to single events, the following attributes 
of causality must be considered : 

Antecedence postulates that the cause must be prior to, or at 
least simultaneous with, the effect. 

Contiguity postulates that cause and effect must be in spatial 
contact or connected by a chain of intermediate things in 
contact. 
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ILLUSTRATION: ASTRONOMY AND PARTICLE 
MECHANICS 

actually made his discoveries nor do T ^ * f®** m&Q 

- 

NewtoxdlntWy^fcelttialmot- astronom r- 

of a mathematical nT1 ri ,g . . ° tl0ns 18 an excellent example 

This holds S5?5S deterministic, yet not causal, description. 
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oftheplouetahvlr' Ptolemy represented the motion 
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I do not trieh Tm S replMed 4116 <*">“ * «%*«• 

the standpoint of the „„ ^erse. I just consider it from 

I^Wnrc^. , ^ m J ta0kTO *“ di8OT ^- Neither 

haviour of the planets Except the rdtinT & Cause for the be ' 
Creator. What they do is in mod tUQate ° ause > 1116 ^ of the 
the establishment of functions, mathe “atical language, 

- .. * 1==/l{<) ’ **=/*«), ..., 

sssssrss Si^“ gonw “ 

the corresponding geoEa^rioafs^•r^ S ^ 1Il0 ^ 10nS, ormore ^urately 
than those of Ptofemv J” 6 ®’ 816 ™ry : much simpler 

* “ P6,ii ‘ ion * 



ILLUSTRATION: ASTRONOMY AND PARTICLE MECHANICS II 
Descartes’s cosmology can be regarded as an early attempt to 
establish causal laws for the planetary orbits by assuming a 
complicated vortex motion of some kind of ether, and it is 
remarkable that this construction satisfies contiguity. But it 
failed because it lacked the main feature of scientific progress; 
it was not based on a reasonable induction from facts. Of course, 
no code of rules existed, nor did Descartes’s writings provide it 
at that time. The principles of the code accepted to-day are 
implicitly contained in the works of Galileo and Newton, who 
dem onstrated them with their actual discoveries in physics and 
astronomy — in the same way as Haydn established the rules of 
the sonata by writing lovely music in this form. 

Galileo’s work precodes Newton’s not only in time but also 
in logical order; for Galileo was experimenting with terrestrial 
objects according to the rules of repetition and variation of con- 
ditions, while Newton’s astronomical matorial was purely obser- 
vational and restricted. Galileo observed how a falling body 
moves, and studied the conditions on which the motion depends. 
His results can be condensed into the well-known formula for the 
vertical coordinate of a small body or ‘particle’ as a funotion of 

timo > S - -\gl\ (3.1) 

where g is a constant, i.o. independent not only of time but also 
of the falling body. The only thing this quantity g can depend 
upon is the body towards which the motion is taking place, the 
earth — a conclusion which is almost too obvious to be formu- 
lated ; for if the motion is chocked by my hand, I feel the weight 
as a pressure directed downwards towards the earth. Hence the 
constant g must be interpreted as a property of the earth, not of 
the falling body. 

IMng Nowton’s calculus (denoting the time derivative by a 
dot) and generalizing for all throe coordinates, one obtains the 
equations £ , r 0 , y « 0, z « -g, (3.2) 

which describe the trajectories of particles upon the earth with 
arbitrary initial positions and velocities. 

These formulae condense the description of an infinite number 
of orbits and motions in one single simple statement: that some 
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dentrftT^- ^ “f*? " th6 Same for the vhole class, indepen- 
t at * mdlV 5 dual therefore ^Pending only on the one 

£mel^ g nam6ly th6 6arth - H ence thL property! 

wStv\r i a 3 CC6le f ati0n> must be <due to the earth’, or 
T i. y * h ® ® arfc k > or a force exerted by the earth 

notion of** 0106 mdicates a specification of the general 
figures Aoartfrn 1U ^f J a measur able cause, expressible in 
JfSLiS!? thls ^ement, Galileo’s work is just a case 
v + aJity in the sense of my definition. 

fa m aJw (3 ' 2 lr 01 ™ t ™ e ' ain ” 11116 'oSbof of the force 
ZZ.T^, r “‘ 6 0f okm S e of velocity in time. This 

"‘* i “ “ ” aI i r “ ult of observation and measurement, and has no 

detmLSfo 'll 0 * W ? at “? Ver ' A oooooqoenoe of this fact fa the 
^ >*»««):« the position and the 
min. its ™ t P 6 " 6 S ‘ vm at “T time, the equations deter- 

«me .to pomton and velocity at any other time 
to&ct, any other tone in tie past or future. This show, that 

a efaT ““'fonntothe postulate of antecedence: 

latoTtn^Sf n “ 0™>0t be regarded a, the cause of a 
Jer ntna ton, because the relation between them fa completely 

symmetrical; each determines the other Th™ ™ J 

~n y with°rSn.r^zr n *■*—*■—*» 

motionB. The material" on 

which he based lug fi fi 
^ 0nly 8ix Pj^ets (inXdTglhe 
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induction to any planet they are, according to Newton, equiva- 
lent to the statement that the acceleration is always directed 
towards the sun and varies inversely as the square of the distance 
r from the sun, /xr~ 2 , where fi is a constant which may differ 
from planet to planet. But it is the third law which reveals the 
causal relation to the sun. It says that the ratio of the square 
of the period and the cube of the principal axis is the same for 
all planets — induced from data about the six known planets. 
This implies, as Newton showed (see Appendix, 2), that the 
constant \l is the same for all planets. Hence as in Galileo’s case, 
it can depend only on the single other body involved, the sun. 
In this way the interpretation is obtained that the centripetal 
acceleration ft? — 2 is ‘due to the sun’, or ‘caused by the sun’, or 
‘a force exerted by the sun’. 

The moon and the other planetary satellites were then the 
material for the induction which led to the generalization of a 
mutual attraction of all bodies towards one another. The most 
amazing step, rightly admired by Newton’s contemporaries and 
later generations, was the inclusion of terrestrial bodies in the 
law derived from the heavens. This is in fact the idea symbolized 
by the apocryphal story of the falling apple : terrestrial gravity 
was regarded by Newton as identical with celestial attraction. 
By applying his laws of motion to the system earth-moon, he 
could calculate Galileo’s constant of gravity g from geodetical 
and astronomical data: namely, 


_ 47T 2 i? 3 
^ r 2 ^ 2 ’ 


(3.3) 


where r is the radius of the earth, R the distance between the 
centres of earth and moon, and T the time of revolution of 
the moon (sidereal month). 

The general equations for the motion of n particles under 
mutual gravitation read in modern vector notation 

f a = -grad a F, 


(3.4) 
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where r a is the position vector of the particle a (a = 1, 2 ,...,»), 
r a p = |r a — r^| the distance of two particles a and /J, V the 
potential of the gravitational forces. 

Newton also succeeded in generalizing the laws of motion for 
other non-gravitational forces by introducing the notion of mass, 
or more precisely of inertial mass. Newton’s method of repre- 
senting his results in an axiomatic form does not reveal the way 
he obtained them. It is, however, possible to regard this step as 
a case of ordinary causality derived by induction. One has to 
observe the acceleration of different particles produced by the 
same non-gravitational (say elastio) forces at the same point of 
space; they are found to differ, but not in direction, only in 
magnitude. Therefore, one can infer by induction the existence 
of a scalar faotor characteristic for the resistance of a particle 
against acceleration or its inertia. This factor is called ‘mass’. 
It may still depend on velocity as is assumed in modem theory 
of relativity. This can be cheoked by experiment, and as in 
Newton’s time no such effect could be observed, the mass was 
regarded as a constant. 

Then the general i zsd'equations of motions read 


mj a = F a , (3.6) 

where m a (a = 1, 2 ,...,%) are the masses and F a the force vectors 
which depend on the mutual distances r a p of all the particles. 
As in the case of gravitation, they may be derivable from a 
potential V by the operation 


F a =-grad a V, (3.7) 

where V is a function of the r^. The most general form of V for 
forces inverse to the square of the distance would be ( 2' means 

a, ft 

summing over all a,j8 except a = /?) 



(3.8) 


where ^ are constants; comparison with (3.4) and (3.6) shows, 
however, that these must have the form 


Potfl = m *pp- 


(3.9) 
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Newton applies further a law of symmetry, stated axiomatio- 
•Uy } namely that action equals reaction, or = /x« a , from 
^ W follows 

MjJ = Kmp, (3.10) 

K is a universal constant, the constant of gravitation. 
' c ^nce the constants of attraction or gravitational masses fi a 
proportional to the inertial masses ra a . 

Neither Newton himself, nor many generations of physicists 
astronomers after him have paid much attention to the law 
*^pressed by (3.10). Astronomical observations left little doubt 
**at it was correct, and it was proved by terrestrial observations 
(^ith suitable pendulums) to hold with extreme accuracy 
( JSlotvos and others). Two centuries went by before Einstein saw 
fundamental problem , contained in the simple equation 
(3.10), and built on it the colossal structure of his theory of 
Several relativity, to which we have to return later. 

But this is not our concern here. We have to examine 
Newton’s equations from the standpoint of the principle of 
e ausality. I hope I have made it clear that they imply the notion 
cause exactly in the same sense as it is always used by the 
^^cperimentalist, namely signifying a verifiable dependence of 
on© thing or another. Yet this one thing is, in Galileo’s and 
^Newton’s theory alike, a peculiar quantity, namely an accelera- 
tion. The peculiarity is not only that it cannot be seen or read 
from a measuring tape, but that it contains the time implicitly. 
Iri fact, Newton’s equations determine the motion of a system 
in time completely for any given initial state (position and 
velocity of all particles involved). In this way, causation ’ leads 
“bo * determination not as a new metaphysical principle, but as a 
physical fact, like any other. However, just as in Galileo’s 
simpler case, so here the relation between two consecutive con- 
figurations of the system is mutual and symmetrical. This has a 
Id earing on the question whether the principle of antecedence 
holds. As this applies, according to our definition, only to the 
cause-effect relation between single events, one has to change 
t>h© standpoint. Instead of considering the acceleration of one 
body to be caused by the other bodies, one considers two 



16 ILLUSTRATION : ASTRONOMY AND PARTICLE MECHANICS 



it makes sense to call the earlier one the cause of the later one. 
But it makes no sense, for the relation between the two states is 
symmetrical. One could, with the same right, call the later 
configuration the cause of the earlier one. 

The root of this symmetry is Newton’s definition of time. 
Whatever he says about the notion of time (in Principia, 
Scholium I) as a unif orm flow, the use he makes of it contains 
nothing of a flow in one direction. Newton’s time is just an 
independent variable t appearing in the equations of motion, in 
suoh a way that if £ is changed into — t, the equations remain 
the same. It follows that, if all velocities are reversed, the 
system just goes back the same way ; it is completely reversible. 

Newton’s time variable t is obviously an idealization abstracted 
from simple mechanical models and astronomical observations, 
fitting well into celestial motion, but not into ordinary experi- 
ence. To us it appears that life on earth is going definitely in one 
direction, from past to future, from birth to death, and the 
perception of time in our mind is that of an irresistible and 
irreversible current. 

Another feature of Newton’s dynamics was repugnant to 
many of his contemporaries, in particular the followers of 
Descartes, whose cosmology, whatever else its shortcomings, 
satisfied the principle of contiguity, as I have called the condition 
that cause and effect should be in spatial contact. Newton’s 
forces, the quantitative expressions for causes of motion, are sup- 
posed to act through empty space, so that cause and effect are 
simultaneous whatever the distance. Newton himself refrained 
from entering into a metaphysical controversy and insisted that 
the facts led unambiguously to his results. Indeed, the language 
of facts was so strong that they silenced the philosophical objec- 
tions, and only when new facts revealed to a later generation the 
propagation of forces with finite velocity, was the problem of 
contiguity in gravitation taken up. In spite of these difficulties, 
Newton’s dynamics has served many generations of physicists 
and is useful, even indispensable, to-day. 
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CONTIGUITY 

MECHANICS OF CONTINUOUS MEDIA 

Although I maintain that neither causality itself nor its 
attributes, which I called the principles of antecedence and 
contiguity, are metaphysical, and that only the inference by 
induotion transcends experience, there is no doubt that these 
ideas have a strong power over the human mind, and we have 
evidence enough that they have influenced the development of 
classical physics. Much effort has been made to reconcile 
Newton’s laws with these postulates. Contiguity is closely bound 
up with the introduction of contact forces, pressures, tensions, 
first in ordinary material bodies, then in the electromagnetic 
ether, and thus to the idea of fields of foroes ; but the systematic 
application of contiguity to gravitation exploded Newton’s 
theory, which was superseded by Einstein’s relativity. Similar 
was the fate of the postulate of antecedence ; it is closely bound 
up with irreversibility in time, and found its first quantitative 
formulation in thermodynamics . The reconciliation of it with 
Newton’s laws was attempted by atomistics and physioal 
statistics ; the idea being that accumulations of immense num- 
bers of invisible Newtonian particles, atoms, or molecules appear 
to the observer to have the feature of irreversibility for statistical 
reasons. The atoms were first hypothetical, but soon they were 
taken seriously, and one began to search for them, with increas- 
ing success. They became more and more real, and finally even 
visible. And then it turned out that they were no Newtonian 
particles at all. Whereupon the whole classical physios exploded, 
to be replaced by quantum theory. Looked at from the point 
of view of our principles, the situation in quantum theory is 
reversed. Determinism (which is so prominent a characteristic 
of Newton’s theory) is abandoned, but contiguity and ante- 
cedence (violated by Newton’s laws) are preserved to a consider- 
able degree. Causality, which in my formulation is independent 
of antecedence and contiguity, is not affected by these changes : 

5181 q 
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scientific work will always be the search for causal interdepen- 
dence of phenomena. 

After this summary of the following discussion, let us return to 
the question why violationof the principlesof contiguity and ante- 
cedence in Newton’s theory was first accepted — though not with- 
out protest — but later amended and finally rejected. This change 
was due to the transition from celestial to terrestrial mechanics. 

The success of Newton’s theory was mainly in the field of 
planetary motion, and there it was overwhelming indeed. It is 
not my purpose to expand on the history of astronomy after 
Newton; it suffices to remember that the power of analytical 
mechanics to describe and predict accurately the observations 
led many to the conviction that it was the final formulation of 
the ultimate laws of nature. 

The main attention was paid to the mathematical investiga- 
tion of the equations of motion, and the works of Lagrange, 
Laplace, Gauss, Hamilton, and many others are a lasting 
memorial of this epoch. Of all these writings, I shall dwell only 
for a moment on that of Hamilton, because his formulation of 
Newton’s laws is the most general and elegant one, and because 
they will be used over and over again in the following lectures. 
So permit me a short mathematical interlude which has nothing 
directly to do with cause and chance. 

Hamilton considers a system of particles described by any 
(in general non-Cartesian) coordinates q^q^,...; then the poten- 
tial energy is a function of these, V(q v q 2 ,...) or shortly V(q), and 
the kinetic energy T a function of both the coordinates and the 
generalized velocities q v q 2 T(q, q). He then defines general- 

ized momenta 

(^. 1 ) 


and regards the total energy T-\- V as a function of the q a and p a . 
This function T+v = H{q p) (4 . 2 ) 

is to-day called the Hamiltonian. 

The equations of motion assume the simple ‘canonical’ form 
8H . 8H 
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from which one reads at once the conservation law of energy, 

jj . «, 0, If « const. (4.1) 

It is this set of formulae which has survived the most violent 
revolution of physical ideas which has ever taken place, the 
transition to quantum mechanics. 

Returning now to the post-Newtonian period there was, 
simultaneous with the astronomical applications and continua- 
tions of the theory, a lively interest in applying it to ordinary 
terrestrial physics. Even hen* Newton had shown the way and 
had calculated, for instance, the velocity of sound in a fluid. 
Eventually the mechanics of elastic solids brought, about a 
modilieution of Newton's definition of force which satisfies con- 
tiguity. Much of this work is due to the great mathematician 
rauehy. He started, as many before him, by treating a solid as 
an aggregate of tiny particles, acting on one another with 
Newtonian non contiguous forces of short range— anticipating 
to some degree the modern atomistic standpoint. But there was 
of course, at that time, no evidence of the physical reality of 
these particles. In the physical applications all traces of them 
were obliterat ed by averaging. The form of these results sug- 
gested to Cauchy another method of approach where particle 
mechanics is completely discarded. Matter is considered to bo a 
real continuum in the mathematical sense, so that it. has a 
meaning to speak of a. force between two pieces of matter 
separated by a surface. This seems to be, from our modern 
standpoint-, a step in the wrong direction, as wo know matter 
to be discontinuous. But Cauchy's work showed how con- 
tiguity could be introduced into mechanics; the importance of 
this point became evident when the new method was applied to 
the ether, t he carrier of light, and of electric and magnetic forces, 
which even to day is still regarded as continuous though it has 
lost, most of the characteristic properties of a substance and can 
hardly be culled a continuous mvtlium . 

In this theory nil laws appear in the form of partial differential 
equations, in which the three space coordinates appear together 
with the time ns independent coordinates. 
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I shall give a short sketch of the mechanics of continuous 
media. 

Mass, velocity, and all other properties of matter are con- 
sidered continuously distributed in space. The mass per unit 
volume or density p is then a function of the space coordinates, 
and the same holds for the current of mass u = pv (namely the 
quantity of mass passing through a surface per unit area and 
unit time). The conservation (indestructibility) of mn.ua then 
leads to the so-called continuity equation (see Appen div , 3) 

/j+div u = 0. (4.5) 

Concerning the forces, one has to assume that, if the sub- 
stance is regarded as separated into two parts by a surface, each 
part exerts a push or pull through this surface on the other which, 
measured per unit area, is called tension or stress. A simplp. 
mathematical consideration, based on the equilibrium conditions 
for the resultant forces acting on the surfaces of a volume 
element, shows that it suffices to define these tension forces for 
three non-coplanar surface elements, say those parallel to the 
three coordinate planes; the force on the element normal to x 
being T x with components T^T^T^, the other two forces 
correspondingly T y (T yx , T w , T VI ) and T s (T„, TJ. Then the 
force on a surface element with the normal uni t vector 


n (n x ,n y ,n z ) 

is given by T n = T^+T^+T,*,,. (4.6) 

Application of the law of moments to a small volume element 
shows (see Appendix, 3) that 


T =T 

VZ - L ssyi 


T == T 


T =T 

J 'xy J -yx m 


(4.7) 


Hence the quantities T form a symmetrical matrix, the stress 

7 


T = 



T. 


VZ 


T„ 


(4.8) 


Newton s law applied to a volume element then leads to the 
equations 

* dy -r 

fg-fcTT. 


(4.9) 
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where div T is a vector with the components 
(divT)* = 

'* 8x ^ 8x ^ 8x ’ 
and djdt the operator 


w V V U , v 

dt~8i +V *te +V Vto+ V ’8-z’ 


(4.10) 

(4.11) 


which is called the * convective derivative*. 

(4.9) together with (4.5) are the new equations of motion 
which satisfy the postulate of contiguity. They are the proto- 
type for all subsequent field theories. In the present form they 
are still incomplete and rather void of meaning, as the stress 
tensor is not specified in its dependence on the physical condi- 
tions of the system — just in the same way as Newton’s equations 
are void of meaning if the forces are not specified with their 
dependence on the configuration of the particles. The configura- 
tion of a continuous system cannot be described by the values 
of a finite number of variables, but by certain space functions, 
called ‘strain-components \ They are defined in this way: 
A small (mfinite&imal) volume of initially spherical shape will be 
transformed by the deformation into an ellipsoid ; the equation 
of this has the form 


e n # 2 + e 22 2/ 2 + 6 33 3 2 +2e 23 yz-\- 2e sl zx-j-2e 12 xy = c, (4.12) 
where e is an (infinitesimal) constant, measuring the absolute 
dimensions, and e lv e 22i ...,e 12 are six quantities depending on 
the position x , y, z of the centre of the sphere. These are the 
components of the strain tensor e. 

In the theory of elasticity it is assumed that the stress com- 
ponents Ty are linear functions of the strain components 
(Hooke’s law). 

In hydrodynamics the relation between T and e involves 
space- and time-derivatives of e^. In plastic solids the situation 
is still more complicated. 

We need not enter into these different branches of the 
mechanics in continuous media. The only important point for 
us is this: Contact forces spread not instantaneously but with 
finite velocity. This is the main feature distinguishing Cauchy’s 
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contiguous mechanics from Newton’s non-contiguous. The 
simplest example is an elastic fluid (liquid or gas). Here the 
stress tensor T has only diagonal elements which are equal and 
represent the pressure#. The configuration can also be described 
by one variable, the density p or, for a given mass, the volume V. 
The relation between# andF may be any funotion # =f(V ) — we 
shall have to remember this later when we have to deal with 
thermodynamics. For small disturbances of equilibrium the 
general equations reduce to linear ones ; any quantity <f> in an 
isotropic fluid (change of volume or pressure) satisfies the linear 
wave equation , , 

where A is Laplace’s differential operator 

A = divgrad = il-|-il+iL, (4.14) 

dx*^dy*^dz* y 1 

and c a constant which, is easily found to mean the phase 
velocity of a plane harmonic wave 

</> = -4sin-^ {x— ct). 

A 

The equation (4.13) l ints up mechanics with other branches 
of physics which have independently developed, optics and 
electromagnetism. 

ELECTROMAGNETIC F IELD S 

The history of optics, in particular Newton’s contributions 
and his dispute with Huygens about the corpuscular or wave 
nature of light, is so well known that I need not speak about it. 
A hundred years after Newton, the wave nature of light was 
established by Young and Fresnel with the help of experiments 
on diffraction and interference. Wave equations of the type 
(4.13) were used as a matter of course to desoribe the observa- 
tions, where now <f> means the amplitude of the vibration. 

But what is it that vibrates ? A name, ‘ether,’ was ready to 
hand, and its ability to propagate transverse waves suggested 
that it was oomparable to an elastic solid. In this way it came 
to pass that the ether-filled vacuum was the carrier of contact 
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forces, spreading with finite velocity. They existed for a long 
period peacefully beside Newton’s instantaneous forces of 
gravitation, and other similar forces introduced to describe 
elementary experiences in electricity and magnetism. These 
forces are usually connected with Coulomb’s name, who verified 
them by direct measurements of the intensity of attraction and 
repulsion between small charged bodies, and between the 
poles of needle-shaped magnets. He found a law of the same 
type as that of Newton, of the form /xr“ 2 , where the constant /x 
depends on the state of electrification or magnetization respec- 
tively of the interacting particles; by applying the law of 
action and reaction /x can be split into factors, /x = in the 
electric case, where e 1 , e 2 are called the charges. It must, 
however, have been remarked that this law was already 
established earlier and with a higher degree of accuracy by 
Cavendish and Priestley by an indirect reasoning, with the help 
of the fact that a closed conductor screens a charged particle 
from the influence of outside charges ; this argument, though still 
dressed up in the language of Newtonian forces, is already quite 
close to notions of field theory. 

It was the attempt to formulate the mechanical interactions 
between linear currents (in thin wires) in terms of Newtonian 
forces which entangled physics in the first part of the nineteenth 
century in serious difficulties. Meanwhile Paraday had begun 
his investigations unbiased by any mathematical theory, and 
accumulated direct evidence for understanding electric and 
magnetic phenomena with the help of contact forces. He spoke 
about pressures and tensions in the media surrounding charged 
bodies, using the expressions introduced in the theory of elas- 
ticity, yet with considerable and somewhat strange modifica- 
tions. Indeed, the strangeness of these assumptions made it 
diffi cult for his learned contemporaries to accept his ideas and 
to discard the well-established Newtonian fashion of descrip- 
tion. Yet seen from our modem standpoint, there is no intrinsic 
difference between the two methods, as long as only static and 
stationary phenomena are considered. Mathematical analysis 
shows that the resultant forces on observable bodies can be 
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expressed either as integrals over elementary contributions of 
the Newtonian— or better Coulombian— type acting over the 
distance, or by surface integrals of tensions derived from field 
equations. This holds not only for conductors in vacuo, but also 
for dielectric and magnetizable substances ; it is true that in the 
latter case the Coulombian forces lead to integral equations 
which axe somewhat involved, but the differential equations of 
the field axe, in spite of their simpler aspect, not intrinsically 
simpler. This is often overlooked in modem text-books. How- 
ever, in Faraday’s time this equivalence of differential and 
integral equations for the forces was not known, and if it had 
been, Faraday would not have caxed. His conviction of the 
superiority of contact forces over Coulombian forces rested on 
his physical intuition. It needed another, more mathematically- 
minded genius, Clerk Maxwell, to find the clue which made it 
impossible to accept forces acting instantaneously over finite 
distances: the finite velocity of propagation. It is not easy to 
analyse exactly the epistemological and experimental founda- 
tions of Maxwell’s prediction, as his first papers make use of 
rather weird models and the purity of his thought appears only 
in his later pubhcations. I think the process which led to Max- 
well s equations, stripped of all unnecessary verbiage andround- 
about ways was this : By combining all the known experimental 
facts about charges, magnetic poles, currents, and the forces 
etween riiem, he could establish a set of field equations con- 
necting the spatial and temporal changes of the electric and 
magnetic field strength (foroe per unit charge) with the electric 
charge density ^and current. But if these were combined with 
the comhtion that any change of charge could occur only by 
(6X P reS86d * a continuity equation analo- 
Tn l 4 ' 5 ’ 866 A J p f dlx * 4 >’ 811 Adequacy became obvious. 

Jfnguage ° f that time ’ the **ult was formulated by 
saying that no open currents (like discharge of condensers! 
Zf ^ ‘Ms theory. Therefore someZg Zj 

, , ng m equations , and an inspection showed a suspicious 
fc.toe, e look of symmetry. The terms expressing jJLC! 
notion lew (production of electric force by the time variation 
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of the magnetic field) had no counterpart obtained by exc hanging 
the symbols for electric and magnetic quantities (production of 
magnetic force by the time variation of the electric field). With- 
out any direct experimental evidence Maxwell postulated this 
inverse effect and added to his equations the corresponding 
term, which expresses that a change of the electric field (dis- 
placement current) is, in its magnetic action, equivalent to an 
ordinary current. It was a guess based on a belief in harmony. 
Yet by some mathematical reasoning it can be co nn ected with 
one single but highly significant fact which sufficed to convince 
Maxwell of the correctness of his conjeoture — just as Newton 
was convinced of the correctness of his law of gravitation by one 
single numerical coincidence, the calculation of terrestrial 
gravity from the moon’s orbit. Maxwell showed that his modified 
equations had solutions representing waves, the velocity, c, of 
which could be expressed in terms of purely electric and magnetic 
constants ; for the vacuum c turned out to be equal to the ratio 
of a unit of charge measured electrostatically (by Coulomb’s 
law) and electromagnetically (by Oersted’s law). This ratio, a 
quantity of the dimensions of a velocity, was known from 
measurements by Kohlrausch and Weber, and its numerical 
value coincided with the velocity of light. That could hardly 
be accidental, indeed, and Maxwell could pronounce the electro- 
magnetic theory of light. 

The final confirmation of Maxwell’s theory was, after his 
death, obtained by Hertz’s discovery of electromagnetic waves. 

I cannot follow the further course of events in the establish- 
ment of electromagnetic theory. I only wish to stress the point 
that the use of contact forces and field equations, i.e. the 
establishment of contiguity, in electromagnetism was the result 
of a long struggle against preconceptions of Newtonian origin. 
This confirms my view that the question of contiguity is not a 
metaphysical one, but an empirical one. 

We have now to see whether the laws of electromagnetism 
satisfy the principle of antecedence. An inspection of Maxwell’s 
equations (see Appendix, 4) shows that a reversal of time, 
leaves everything, including the continuity equation, 
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unchanged, if the electric density and field are kept unchanged 
while the electric current and magnetic field are reversed. This 
is a kind of reversibility very similar to that of mechanics, 
where a change of the sign of all velocities makes the system 
return to its initial state. The difference is only a practical one : 
a change of sign of all current densities and the whole magnetic 
field is not as simple to perform as that of a finite set of velocities. 
The situation is best seen by considering an electromagnetic 
wave spreading from a point source ; the corresponding solution 
of Maxwell's equation is given by so-called retarded potentials 
which express the electromagnetic state at a point P for the time 
t in terms of the motion of the source at the time t—rjc , where r 
is the distance of P from the source. But there also exist other 
solutions, advanced potentials, which refer to the later time 
t+r/c and represent a wave contracting towards the source. 

Such contracting waves are of course necessary for solving 
certain problems. Imagine, for instance, a spherical wave 
reflected by a concentrio spherical mirror. However, such a 
mirror must be absolutely perfect to do its duty, and there 
appears to be something improbable about the occurrence of 
advanced potentials in nature. For the description of ele- 
mentary processes of emission of atoms or electrons one has 
supplemented Maxwell's equations by the rule that only retarded 
solutions are allowed. In this way a kind of irreversibility can 
be introduced and the principle of antecedence satisfied. But 
this is altogether artificial and unsatisfactory. The irreversi- 
bility of actual electromagnetic processes has its roots in other 
facts which we shall later have to describe in detail. Maxwell's 
equations themselves do not satisfy the postulate of antecedence. 

RELATIVITY and the field theory of gravitation 

The situation which we have now reached is that which I 
found when I began to study almost half a century ago. There 
existed, more or less peacefully side by side, Newton's mechanics 
of instantaneous action over any distance, Cauchy's mechanics 
of continuous substances, and Maxwell's electrodynamics, the 
latter two satisfying the postulate of contiguity. Of these theories, 
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Maxwell’s seemed to be the most promising and fertilo, and the 
idea began to spread that possibly all forces of nature might be of 
electromagnetic origin. The problem had to be onvisagod, how to 
reconcile Newton’s gravitational forces with tho postulate of con- 
tiguity ; the solution was Einstein’s general theory of relativity. 

This is a long and interesting story by itself which involves 
not only the notion of cause with which we are concerned here, 
but other philosophical concepts, namely those concornod with 
space and time. A detailed discussion of these problems would 
lead us too far away from our subject, and I think it hardly 
necessary to dwell on them because relativity is to-day widely 
known and part of the syllabus of the student of mathomatios 
and physics as well. So I shall give a very short outline only. 

The physical problems which led to the theory of relativity 
were those concerned with the optical and electromagnetic 
phenomena of fast-moving bodies. There aro two types of 
experiments: those using the high velocity of oolestiai bodies 
(e.g. Michelson’s and Morley’s experiment) and those using fast 
electrons or ions (e.g. Bucherer’s measurement of the mass of 
electrons in cathode rays as a function of tho velocity). Tho work 
of Lorentz, FitzGerald, Poincard, and others prepared the ground 
for Einstein’s discovery that tho root of all difficulties was tho 
assumption of a universal time valid for all moving systems of 
reference. He showed that this assumption has no foundation 
in any possible experience and he replaced it by a simple defini- 
tion of relative time, valid in a given coordinate system, hut 
different from the time of another systom in relative motion. 
The formal law of transformation from ono space-time system 
to another was already known, owing to an analysis of Lorentz ; 
it is in faot an intrinsic property of Maxwell’s equations. The 
Lorentz transformation is linear; it expresses the physical 
equivalence of systems in relative motion with constant velocity 
(see Appendix, 5). 

Einstein's theory of gravitation is formally based on a general- 
ization of these transformations into arbitrary, non-linear ones; 
with the help of these one can express tho transition from one 
system of reference to another accoloratod (and simultaneously 
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deformed) one. The physical idea behind this mathematical 
formalism has been already mentioned: the exact proportion- 
ality of mass, as defined by inertia, and of mass as defined by 
gravitation, equation (3.10) ; or, in other words, the fact that 
in Newton’s law of gravitational motion (3.4) the (inertial) mass 
does not appear. 

Einstein succeeded in establishing equations for the gravita- 
tional field by identifying the components of this field with the 
quantities g^ v which define the geometry of space-time, namely 
the coefficients of the line element 

da* = 2 g llv dx> i dx v , (4.15) 

IAV 

where x 1 , x 2 , x 3 stand for the space coordinates x, y, z, x* for the 
time t. 

In ordinary 3-dimensional Euclidean geometry the are 
constant and can, by a proper choice of the coordinate system, 
be normalized in such a way that 

flW» = 1, g^v = 0 for y # v. 

Minkowski showed that special relativity can be regarded as 
a 4-dimensional geometry, where time is added as the fourth 
coordinate, but stall with constant g^ v , which can be normalized to 

011 = 022 = 083 = 044 = — 1, 0,xv = 0 for JL4 ^ v. (4.16) 

It was further known from the work of Riemann that a very 
general type of non-Euclidean geometry in 3-dimensional space 
could be obtained by taking the g^ v as variable functions of 
a: 1 ,* 2 ,* 8 , and the mathematical properties of this geometry had 
been thoroughly studied (Levi-Civita, Ricci). 

Einstein generalized Riemann’s formalism to four dimensions, 
assuming that the g^ v depend not only on x 1 , x 2 , jc 3 , but also on 
x*, the time. However, he regarded the g^ v not as given functions 
of x 1 , x 2 , 3 ?, x i but as field quantities to be calculated from the 
distribution of matter. He formed a set of quantities which 
can be regarded as a measure of the ‘curvature’ of space and are 
functions of the g^ and their first and second derivatives, and 
postulated equations of the form 


(4.17) 
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where k is a constant and the are generalizations of the 
tensions in matter, defined in (4.8): one has to supplement the 
tensor T by a fourth row and column, whepe T 14 , T u , are 
the components of the density of momentum, the density of 
energy. These equations (4.17) are invariant in a very general 
sense, namely for all continuous transformations of space-time, 
and they are essentially uniquely determined by t his property 
and the postulate that no higher derivatives than the second- 
order ones should appear. 

If the distribution of matter is given, i.e. the T^ v are known, 
the field equations (4.17) allow one to calculate the g ' , i.e. the 
geometry of space. Einstein found the solution for a mass point 
as source of the field, and by assuming that the motion of another 
particle was determined by a geodesic, or shortest, or straightest 
line in this geometry, he showed that Newton’s laws of planetary 
motion follow as a first approximation. But higher approxima- 
tions lead to small deviations, some of which can be observed. 
I cannot enter into the discussion of all the consequences of the 
new gravitational theory ; Einstein’s predictions have been con- 
firmed, although some of them are at the limit of observational 
technique. But I wish to add a remark about a theoretical point 
which is not so well known, yet very important. The assump- 
tion that the motion of a particle is given by a geodesic is 
obviously an unsatisfactory feature ; one would expect that the 
field equations alone should determine not only the field pro- 
duced by particles but also the reaction of the particles to the 
field, that is their motion. Einstein, with his collaborators 
Infeld and Hoffmann, has proved that this is in fact the case, 
and the same result has been obtained independently and, as I 
think, in a considerably simpler way, by the Russian physicist, 
Fock. On the basis of these admirable papers, one can say that 
the field theory of gravitation is logically perfect — whether it 
will stand all observational tests remains to be seen. 

From the standpoint of the philosophical problem, which is 
the subject of these lectures, there are several conclusions to be 
drawn. The first is that now physical geometry, that is, not some 
abstract mathematical system but the geometrical aspect of the 
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behaviour of actual bodies, is subject to the cause-effect relation 
and to all related principles like any other branch of science. 
The mathematicians often stress the opposite point of view; 
they speak of the geometrization of physics, but though it cannot 
be denied that the mathematical beauty of this method has 
inspired numerous valuable investigations, it seems to me an 
over-estimation of the formalism. The main point is that 
Einstein’s geometrical mechanics or mechanical geometry 
satisfies the principle of contiguity. On the other hand, ante- 
cedence, applied to two consecutive configurations as cause and 
effect, is not satisfied, or not more than in electrodynamics ; for 
there is no intrinsic direction in the flow of time contained in 
the equations. The theory is deterministic, at least in principle : 
the future or past motion of particles and the distribution of the 
gravitational field are predictable from the equations, if the 
situation at a given time is known, together with boundary 
conditions (vanishing of field at infinity) for all times. But as 
the gravitational field travels between the particles with finite 
velocity, this statement is not identical with Newtonian deter- 
minism : a knowledge is needed, not only of all particles, but also 
of all gravitational waves (which do not exist in Newton’s 
theory). Emstein himself values the deterministic feature of his 
theory very highly. He regards it as a postulate which has to 
be demanded from any physical theory, and he rejects, there- 
fore, parts of modem physics which do not satisfy it. 

Here I only wish to remark that determinism in field theories 
seems to me of very little significance. To illustrate the power 
of mechanics, Laplace invented a super-mathematician able to 
predict the future of the world provided the positions and veloci- 
ties of all particles at one moment were given to him. I can 
sympathize with him in his arduous task. But I would really 
pity him if he had not only to solve the numerous ordinary 
differential equations of Newtonian type but also the partial 
differential equations of the field theory with the particles as 
singularities. 
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We have now to discuss the experiences which make it possible 
to distinguish in an objective empirical way between past and 
future or, in our terminology, to establish the principle of 
antecedence in the chain, of cause and effect. These experiences 
are connected with the production and transfer of heat. There 
would be a long story to tell about the preliminary steps 
necessary to translate the subjective phenomena of hot and cold 
into the objective language of physics: the distinction between 
the quality ‘temperature’ and the quantity ‘heat’, together 
with the invention of the corresponding instruments, the ther- 


mometer and calorimeter. I take the technical side of this 

development to be well known and I shall use the thermal 

concepts in the usual way, although I shall have to analyse them 

presently from the standpoint of scientific methodology. It was 

only natural that the measurable quantity heat was first 

regarded as a kind of invisible substance called caloric. The 

flow of heat was treated with the methods developed for material 

liquids, yet with one important difference: the inertia of the 

caloric fluid seemed to be negligible; its flow was determined 

by a differential equation which is not of the second but of the 

first order in time. It is obtained from the continuity equation 

(see ( 4 . 5 )) * , ,. 

#+divq = 0 (5.1) 


by assuming that the ohange of the density of heat Q is propor- 
tional to the change of temperature T,8Q = c 8 T (where c is the 
specific heat), while the current of heat q is proportional to the 
negative gradient of temperature, q = — * grad T (where k is 
the coefficient of conductivity). Hence 


c 


ST 

dt 


= kLT, 


( 5 . 2 ) 


a differential equation of the first order in time. This equa- 
tion was the starting-point of one of the greatest discoveries 
in mathematics, Fourier’s theory of expansion of arbitrary 
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functions in terms of orthogonal sets of simple periodic functions, 
the prototype of numerous similar expansions and the embryo 
from which a considerable part of modem analysis and mathe- 
matical physics developed. 

But that is not the aspect from which we have here to regard 
the equation (5.2) ; it is this: 

■ The equation does not allow a change of t into —t, tho result 
cannot be compensated by a change of sign of other variables 
as happens in Maxwell’s equations. Hence the solutions exhibit 
an essential difference of past and future, a definite ‘flow of time’ 
as one is used to say — meaning, of course, a flow of ovonts in 
time. For instance, an elementary solution of (5.2) for the 
temperature distribution in a thin wire along tho sc-dircction is 

T-T 0 = 0*+#**, ( 5 . 3 ) 

which describes the spreading and levelling out of an initially 
high temperature concentrated near the point x 0, an 
obviously irreversible phenomenon. 

I do not know enough of the history of physics to understand 
how this theory of heat conduction was reconciled with tho 
general conviction that the ultimate laws of physics were of tho 
Newtonian reversible type. 

Before a solution of this problem could be attempted another 
important step was necessary: the discovery of tho equivalence 
of heat and mechanical work, or, as we say to-day, of tho first 
law of thermodynamics. It is important to remember that this 
discovery was made considerably later than the invention of tho 
steam-engine. Not only the production of heat by mechanical 
work (e.g. through friction), but also the production of work 
from heat (steam-engine) was known. The new feature was the 
statement that a given amount of heat always corresponds to a 
definite amount of mechanical work, its ‘mechanical equivalent’. 
Robert Mayer pronounced this law on very scanty and indirect 
evidence, but obtained a fairly good value for the equivalent 
from known properties of gases, namely from the difference of 
heat necessary to raise the temperature by one degree if either 
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the volume is kept constant or the gas allowed to do work 
against a constant pressure. Joule investigated the same 
problem by systematic experiments which proved the eggenfi n] 
point, namely that the work necessary to transfer a system from 
one equilibrium state to another depends only on these two 
states, not on the process of application of the work. This is the 
real content of the first law ; the determination of the numerical 
value of the mechanical equivalent, so much stressed in text- 
books, is a matter of physical technique. To get our notions 
clear, we have now to return to the logical and philosophical 
foundations of the theory of heat. 

The problem is to transform the subjective sense impressions 
of hot and cold into objective measurable statements. The latter 
are, of course, again somewhere connected with sense impres- 
sions. You cannot read an instrument without looking at it. 
But there is » difference between this looking at, say, a thermo- 
meter with which a nurse measures the temperature of a patient 
and the feeling of being hot under which the patient suffers. 

It is a general principle of science to rid itself as much as 
possible from sense qualities. This is often misunderstood as 
meaning elimination of sense impressions, which, of course, is 
absurd. Science is based on observation, hence on the use of the 
senses. The problem is to eliminate the subjective features and 
maintain only statements which can be confirmed by several 
individuals in an objective way. It is impossible to explain to 
anybody what I mean by saying ‘This thing is red’ or ‘This 
■thing is hot’. The most I can do is to find out whether other 
persons call the same things red or hot. Science aims at a closer 
relation between word and fact. Its method consists in firwW 
correlations of one kind of. subjective sense impressions with 
other kinds, using the one as indicators for the other, and in this 
■way establishes what is called a fact of observation. 

Here I have ventured again into metaphysics. At least, a 
philosopher would claim that a thorough study of these 
methodological principles is beyond physics. I think it is again 
a rule of our craft as scientists, like the principle of inductive 
infersncs, and I shall not analyse it further at this moment. 

5131 _ 
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In the case of thermal phenomena, the problem is to define 
the quantities involved — temperature, heat — by means of 
observable objective changes in material bodies. It turns out 
that the concepts of mechanics , configuration and force, strain 
and stress, suffice for this purpose, but that the laws of mechanics 
have to be essentially changed. 

Let us consider for simplicity only systems of fluids, that is of 
continuous media, whose state in equilibrium is defined by one 
single strain quantity, the density, instead of which we can also, 
for a given mass, take the total volume V. There is also only one 
stress quantity, the pressure p. From the standpoint of 
mechanics the pressure in equilibrium is a given function of the 
volume, p =/(F). 

Now all those experiences which are connected with the 
subjective impression of making the fluid hotter or colder, show 
that this law of mechanics is wrong : the pressure can be changed 
at constant volume — namely ‘by heating’ or ‘by cooling’. 

Hence the pressure p can be regarded as an independent 
variable besides the volume V, and this is exactly what thermo- 
dynamics does. 

The generalization for more complicated substances (such as 
those with rigidity or magnetic polarizability) is so obvious that 
I shall stick to the examples of fluids, characterized by two 
thermodynamically independent variables V 9 p. But it is 
necessary to consider systems consisting of several fluids, and 
therefore one has to say a word about different kinds of contact 
between them. 

To shorten the expression, one introduces the idea of ‘walls’ 
separating different fluids. These walls are supposed to be so 
thin that they play no other part in the physical behaviour of 
the system than to define the interaction between two neigh- 
bouring fluids. We shall assume every wall to be impenetrable 
to matter, although in theoretical chemistry semi-permeable 
partitions are used with great advantage. Two kinds of walls 
are to be considered. 

Ail adiabatic wall is defined by the property that equilibrium 
body enclosed by it is not disturbed by any external process 
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as long as no part of the wall is moved (distance forces being 
excluded in the whole consideration). 

Two comments have to be made. The first is that the 
adiabatic property is here defined without using the notion of 
heat , that is essential, for as it is our aim to define the thermal 
concepts in mechanical terms, we cannot use them in the ele- 
mentary definitions. The second remark is that adiabatic 
enclosure of a system can be practically realized, as in the Dewar 
vessel or thermos flask, with a high degree of approximation. 
Without this fact, thermodynamics would be utterly im- 
practicable. J 

The ordinary presentations of this subject, though rather 
careless in their definitions, cannot avoid the assumption of the 
possibility of isolating a system thermally; without this no 
calorimeter would work and heat could not be measured 
The second type of wall is the diathermanous wall, defined by 
the following property: if two bodies are separated by a diather 

SS° va Tu 7 T ^ “ 6quilibrium for arbitrary values of 

betwep eS f Pl ’ 1 and P2> but only a def mite relation 
between these four quantities is satisfied 

F (Pv Vi,Z> 2 , V 2 ) = 0. (5,4) 

d JSSi 1 the ' TFlZ! ni ° f] fchemal1 c <>”tact; the wallis onlyintro- 
duced to symbolize the impossibility of exchange of material. 

two f ratUr ° iS baS<3d ° n the ex J>^ence that 

b d , 5 be ! ng Jn dermal equilibrium with a thud one, are 

the alT * h ® rmal e< l uilibri um with another. If we write (5.4) in 

F{1 ’ 2) = °> tiis Property of equilibrium can be 
expressed by saymg that of the three equations 

m3) = 0, mi) = 0, -F(l, 2) = 0, (5.5) 

miy two aiways involve the third. This is only possible if 
(5.4) can be brought into the form 1 

fi(Pi>r 1 )=A(2> 2 ,r 2 ). (66) 

J?Z 2 ° rr T ° f tt6 two '"’*»• "V 2 - “ ttamometo 
and mtroduce the value of the function 

f 2 (P 2 , V 2 ) = # 


(5.7) 
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as empirical temperature* Then on© has for the other body the 

so-called equation of state 

A(fc.J5)~*. ( 6 -8) 


Any arbitrary function of # can be chosen as empirical tem- 
perature with equal right; the choice is restricted only by 
practical considerations. (It would be impractical to use a ther- 
mometric substance for which two distinguishable states are in 
thermal equilibrium.) The curves # = const, in the p F-plane are 
independent of the temperature scale ; they are called isotherms . 

It is not superfluous to stress the extreme arbitrariness of the 
temperature scale. Any suitable property of any substance can 
be chosen as thermometric indicator, and if this is done, still 
the scale remains at our disposal. If we, for example, choose a 
gas at low pressures, because of the simplicity of the isothermal 
compression law pV = const., there is no reason to take p V = # 
as measure of temperature: one could just as well take (pV ) 2 or 
*J(pV). The definition of an ‘absolute’ scale of temperature was 
therefore an urgent problem which was solved by the discovery 
of the second law of thermodynamics. 

The second fundamental concept of thermodynamics, that of 
heat, can be defined in terms of mechanical quantities by a 
proper interpretation of Joule’s experiments. As I have pointed 
out already, the gist of these experiments lies in the following 
fact: If a body in an adiabatic enclosure is brought from one 
(equikbrium) state to another by applying external work, the 
amount of this work is always the same in whatever form 
(mechanical, electrical, etc.) and manner (slow or fast, etc.) it is 
applied. 


Hence for a given initial state (p 0 , V 0 ) the work done adia- 
baticafiy is a function U of the final state (p, F), and one can 

W=V-U„; (5.9) 

the function U(jp, V ) is called the energy of the system. It is a 
quantity directly measurable by mechanical methods. 

. . Kwe now insider a non-adiabatic process leading from the 
mitial state (p 0 , PJ) to the final state (p, V), the difference 
~~ ^ no * zero, but can be determined if the energy 
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function U(p 9 V) is known from previous experiment. This 
difference jj_u o -W = Q (5.10) 

is called the heat supplied to the system during the process. 
Equation (5.10) is the definition of heat in terms of mechanical 
quantities. 

This procedure presupposes that mechanical work is measur- 
able however it is applied ; that means, for example, that the 
displacements of and the forces on the surface of a stirring- 
wheel in a fluid, or the current and resistance of a wire heating 
the fluid, must be registered even for the most violent reactions. 
Practically this is difficult, and one uses either stationary pro- 
cesses of a comparatively long duration where the irregular 
initial and final stages can be neglected (this includes heating 
by a stationary current), or extremely slow, ‘quasi-static’ 
processes ; these are in general (practically) reversible, since no 
kinetic energy is produced which could be irreversibly destroyed 
by friction. In ordinary thermodynamics one regards every 
curve in the p F-plane as the diagram of a reversible process ; 
that means that one allows infinitely slow heating or cooling by 
bringing the system into thermal contact with a series of large 
heat reservoirs which differ by small amounts of temperature. 
Such an assumption is artificial; it does not even remotely 
correspond to a real experiment. It is also quite superfluous. 
We can restrict ourselves to adiabatic quasi-static procosscs, 
consisting of slow movements of the (adiabatic) walls. For those 
the work done on a simple fluid is 

dW = —pdV, (5.11) 

where p is the equilibrium pressure, and the first theorem of 
thermodynamics (5. 10) assumes the form 

dQ = dU+pdV = 0. (5.12) 

For systems of fluids separated by adiabatic or diathormanous 
walls the energy and the work done are additive (according to 
our definition of the walls) ; hence, for instance, 

dQ = dQi+dQz = dU +p 1 dV 1 +p. i dV i , 

u = u t +u t . 


where 


( 5 . 13 ) 
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solution. This was about forty years ago, but still all text- 
books reproduce the ‘classical’ method, and I am almost certain 
that the same holds for the great majority of lectures — I know, 
however, a few exceptions, namely those of the late K. H. Fowler 
and his school. This state of affairs seems to me one of unhealthy 
conservatism. I take in these lectures an opportunity to advo- 
cate a change. 

The central point of Carath6odory’s method is this. The 
principles from which Kelvin and Clausius derived the second 
law are formulated in such a way as to teovor the greatest 
possible range of processes inoapable of execution: in no way 
whatever can heat be completely transformed into work or 
raised to a higher level of temperature. Caratheodory remarked 
that it is perfectly sufficient to know the existence of mum t 
impossible processes to derive the second law. I need hardly 
say that this is a logical advantage. Moreover, the impossible 
processes are already obtained by scrutinizing Joule’s experi- 
ments a little more carefully. They consisted in bringing a 
system in an adiabatic enclosure from ono equilibrium state to 
another by doing external work : it is an elementary experience, 
almost obvious, that you cannot get your work back by reversing 
the process. And that holds however near the two states are. 
One can therefore say that there exist adiabatically inaccessible 
states in any vicinity of a given state. That is (Jarathtfodory’s 
principle. 

In particular, thore arc neighbour states of any given 
one which are inaccessible by quasi-static adiabatic processes. 
These are represented by adiabatic lines satisfying the 
Pfaffian equation (5.14). Therefore the question arises: Does 
CaratbAodory’s postulate hold for any Pfaffian or does it mean 
a restriction ? 

The latter is the case, and it can be seen by very simple 
mathematics indeed, of which I shall give hero a short sketch 
(see Appendix, 7). 

Let us first consider a Pfaffian equation of two variables, 
x and y, 

dQ = X dx--Y dy. 


(5.15) 



40 AOTECEDENCE: THERMODYNAMICS 

where X,7 are fimctions of x,y. This is equivalent to the 
ordinary differential equation 


dy_ _X 
dx 7’ 


(5.16) 


whioh has an infinite number of solutions <f>(x,y) = const., 
representing a one-parameter set of curves in the (x, y)-plane! 
Along any of these curves one has 


d<f,= te dx +i£ d y ==0 ’ ( 5 . 17 ) 

and this must be the same condition as the given Pfaffian; 
hence one must have , „ . , , 

dQ = Ad<f>. (5.18) 

Each Pfaffian dQ of two variables has therefore an ‘integrating 
denominator’ A, so that dQ / A is a total differential. 

Eor Pfaffians of three (or more) variables, 


dQ = Xdx+ 7dy+Zdz (5.19) 

this doe® not hold. It is easy to give analytical examples (see 

//fnTj’Z ’ but 0116 08,11 see ** geometrically in this way : if in 
(5.19) da :, dy, da are regarded as finite differences $-x, v -y t~z 
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oannot be connected by such a line. This is quite elementary. 
Not quite so obvious is the inverse statement which is used in the 
thermodynamic application: If there are points P in any 
vicinity of a given point P 0 which cannot be connected with P 0 
by a line satisfying the Pfaffian equation (5.19), then there 
exists an integrating denominator and one has dQ — Xd<f>. 

One can intuitively understand this theorem by a continuity 
consideration : All points P inaccessible from P 0 will fill a certain 
volume, bound by a surface of accessible points going through P 0 . 
Further, to each inaccessible point there corresponds another 
one in the opposite direction ; hence the boundary surface must 
contain all accessible points : which proves the existence of the 
function <f>, so that dQ = Xd<f> (see Appendix, 7). 

The application of this theorem to thermodynamics is now 
simple. Combining it with Carath^odory’s principle, one has for 
any two systems 

dQi = X x dfa, dQ% — X z d<f> z , (5.20) 

and for the combined system 

dQ = dQ x +dQ z = Xd<f> ; (5.21) 

hence Xd<f> = X 1 dfa+X i dfa. (6.22) 

Consider in particular two simple fluids in thermal contact; 
then the system has three independent variables V v !£,$, which 
oan be replaced by fa, fa,&. Then (5.22) shows that depends 
only on fa, fa, and not on &, while 


8<j> X 1 8<f> Ag 

0^ _ T’ Ufa = J‘ 

Hence these quotients are also independent of &, 


d_\ 

8& A 


0 , 



from whioh one infers 


(6.23) 


A, d& A 1 ' 

Now A x is a variable of the first fluid only, therefore only 



42 ANTECEDENCE : THERMODYNAMICS 

dependent on 4>x and in the same way Aj = A 2 (^ 2j &). The 
first equality (5.24) can only hold if both quantities depend only 
on Hence 


StogAj _ aiogA a SlogA 

d& W~ a&~ “ 


(5.25) 


where g(&) is a universal function, namely numerically identical 
for different fluids and for the combined system. 

This simple consideration leads with ordinary mathematics 
to the existence of a universal function of temperature. The rest 
is just a matter of normalization. From (5.25) one finds for each 
system 

logA = J g(&) d#-flog<I>, A = 0e /«<*) d& , (5.26) 

where 0 depends on the corresponding <f>. 

If one now defines 


T(&) = Cel o(8) 


- J J j 


where the constant (7 can be fixed by prescribing the value of 
for ^ reproducible states of some normal substance 
(e.g. 21— T 2 = 100°, if 21 corresponds to the boiling-point, 21 
the freezing-point of water at 1 atmosphere of pressure), then 
one has 

dQ = \d<f> = TdS. (5.28) 

2 1 is the thermodynamical or absolute temperature and S the 
entropy. 

. ® quation < 6 ' 28 ) refers only to quasi-static processes, that 
is, to sequences of equilibrium states. To get a result about 
real dynamical phenomena one has to apply Caratheodory’s 

P + Tvi 6 T?f™ C ° nsidering a finite transition from an initial 
state VF a , S° to a final state V lt 7 2 , 8. One can reach the latter 
one m two steps: first changing the volume quasi-statically 
(and admbatically) from FJ, Fg to F X ,F 2 , the entropy remaining 
constant, equal to &, and then changing the state adiabatically! 

u irreversi y (by stirring, etc.) at constant volume, so that 
» goes over into S. 

How if any neighbouring value 8 of S» could be reached in 
way, one would have a contradiction to Carathdodory’s 



ANTECEDENCE: THERMODYNAMICS 43 

principle, as the volumes are of course arbitrarily changeable. 
Hence for each such process one must have either 8 ^ S° or 
S < S». Continuity demands that the same sign holds for all 
initial states; it holds also for different substances since the 
entropy is additive (as can be easily seen). The actual sign > 
or < depends on the choice of the constant C in (5.27); if this is 
chosen so that T is positive, a single experience, say with a gas, 
shows that entropy never decreases. 

It may not be superfluous to add a remark on the behaviour 
of entropy for the case of conduction of heat. As thermo- 
dynamics has to do only with processes where the initial and 
final states are equilibria, stationary flow cannot be treated: one 
can only ask, What is the final state of two initially separated 
bodies brought into thermal contact ? The difficulty is that a 
change of entropy is only defined by quasi-static adiabatic 
processes ; the sudden change of thermal isolation into contact, 
however, is discontinuous and the processes inside the system 
not controllable. Yet one can reduce this process to the one 
considered before. By quasi-static adiabatic changes of volume 
the temperatures can be made equal without change of entropy ; 
then contact can be made without discontinuity, and the initial 
volumes quasi-statically restored, again without a change of 
entropy. The situation is now the same as in the initial state 
considered before, and it follows that any process leading to the 
final state must increase the entropy. 

The whole chain of considerations can be generalized for more 
complicated systems without any difficulty. One has only to 
assume that all independent variables except one are of the type 
represented by the volume, namely arbitrarily changeable. 

If one has to deal, as in chemistry, with substances which are 
mixtures of different components, one can regard the concentra- 
tions of these as arbitrarily variable with the help of somi- 
permeable walls and movable pistons (see Appendix, 8). 

By using thermodynamics a vast amount of knowledge has 
been accumulated not only in physics but in the borderland 
sciences of physico-chemistry, metallurgy, mineralogy, etc. Most 
of it refers to equilibria. In fact, the expression ‘ thermodynamics 9 
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is misleading. The only dynamical statements possible are 
concerned ■with the irreversible transitions from one equilibrium 
state to another, and they are of a very modest character, giving 
the total increase of entropy or the decrease of free energy 
F = U—TS. The irreversible process itself is outside the scope 
of thermodynamics. 

The principle of antecedence is now satisfied ; but this gain is 
paid for by the loss of all details of description which ordinary 
dynamics of continuous media supplies. 

Can this not be mended ? Why not apply the methods of 
Cauchy to thermal processes, by treating each volume element 
as a small thermodynamical system, and regarding not only 
strain, stress, and energy, but also temperature and entropy as 
continuous functions in spaoe ? This has of course been done, 
but with limited success. The reason is that thermodynamics is 
definitely connected with walls or enclosures. We have used the 
adiabatic and diathermanous variety, and mentioned semi- 
permeable walls necessary for chemical separations; but a 
volume element is not surrounded by a wall, it is in free contact 
with its neighbourhood. The thermodynamic change to winch 
it is subject depends therefore on the flux of energy and material 
constituents through its boundary, whioh themselves cannot be 
reduced to mechanics. In some limiting cases, one has found 
simple solutions. For instance, when calculating the velocity of 
sound in a gas, one tried first for the relation between pressure 
p and density p the isothermal law p — cp where c is a constant, 
but found no agreement with experiment ; then one took the 
adiabatio law p = cpv where y is the ratio of the specific boats at 
constant pressure and constant volume (see Appendix, 9), whioh 
gave a much better result. The reason is that for fast vibrations 
there is no time for heat to flow through the boundary of a 
volume element which therefore behaves as if it were adiabati- 
oally enclosed. But by making the vibrations slower and slower, 
one certainly gets into a region whore this assumption does not 
hold any more. Then conduction of heat must be taken into 
account. The hydrodynamioal equations and those of heat con- 
duction have to be regarded as a simultaneous system. In this 
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way a descriptive or phenomenological theory oan be developed 
and has been developed. Yet I am unable to give an account of 
it, as I have never studied it; nor have the majority of physicists 
shown much interest in this kind of thing. One knows that any 
flux of matter and energy can be fitted into Cauchy’s general 
scheme, and there is not much interest in doing it in the most 
general way. Besides, each effect needs separate constants — 
e.g. in liquids compressibility, specific heat, conductivity of heat, 
constants of diffusion ; in solids elastic constants and parameters 
describing plastic flow, etc., and very often these so-called con- 
stants turn out to be not constants, but to depend on other 
quantities (see Appendix, 10). 

Therefore one can rightly say that with ordinary thermo- 
dynamics the descriptive method of physics has come to its 
natural end. Something new had to appear. 
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KINETIC THEORY OF GASES 

The new turn in physios was the introduction of atomistics and 
statistics. 

To Mow up the histoiy of atomistics into the remote past is 
not in the plan of this lecture. We can take it for granted that 
since the days of Demokritos the hypothesis of matter being 
composed of ultimate and indivisible particles was familiar to 
eve^- educated man. It was revived when the time was ripe. 
Lord Kelvin quotes frequently a Father Boscovich as one of the 
hrst to use atomistic considerations to solve physical problems ; 
e lived in the eighteenth century, and there may have been 

ThTfiraf W ^° m f- W n0tMng) thinkin S on tho same lines. 
T t6 “ a ^ C use of at omisties was made in chemistry, 
* allowed the reduction of innumerable substances to a 
lativeiy small stockof dements. Physics followed considerably 

“ SUCh Waa of 110 ff-t use without 

mo^fundamentahdea.nameiythattheobservableproperties 

of matter are not intrinsic qualities of its smallest parts but 

SL°r ttst g ° Teni8d * >■» - £ 2 * 

of probabtly itaeif, ezpreases the*, W k 

Wan exp^e i w 110 ^^ 011 “ the backbono * 
cance of this method in an ^ pinion tho signiii- 

code of rul* wS t Z "Tt * ** 6stabM ™-t of a 
the curious ntu^n ^ ^ 7 1 "*™* Now 

the possibility of scientific W • °° 6 rules, which ensures 
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relation, contains besides many other prescriptions those related 
to observational errors, a branch of the theory of probability. 
This shows that the conception of chance enters into the very 
first steps of scientific activity, in virtue of the fact that no 
observation is absolutely correct. I think chance is a more 
fundamental conception than causality ; for whether in a con- 
crete case a cause-effect relation holds or not can only be judged 
by applying the laws of chance to the observations. 

The history of science reveals a strong tendency to forget this. 
When a scientific theory is firmly established and confirmed, it 
ohanges its character and becomes a part of the metaphysical 
background of the age : a doctrine is transformed into a dogma. 
In fact no scientific doctrine has more than a probability value 
and is open to modification in the light of new experience. 

After this general remark, let us return to the question how 
the notion of chance and probability entered physics itself. 

As early as 1738 Daniel Bernoulli suggested the interpretation 
of gas pressure as the effect of the impact of numerous particles 
on the wall of the container. The actual development of the 
kinetic theory of gases was, however, accomplished much later, 
in the nineteenth century. 

The object of the theory was to explain the mechanical and 
thermodynamical properties of gas from the average behaviour 
of the molecules. For this purpose a statistical hypothesis was 
made, often called the ‘principle of molecular chaos’: for an 
‘ideal’ gas in a closed vessel and in absence of external forces all 
positions and all directions of velocity of the molecules are 
equally probable. 

Applied to a monatomic gas (the atoms are supposed to be 
mass points), this leads at once to a relation between vol um e V, 
pressure p, and mean energy U (see Appendix, 11) 

Vp = \U , (6.1) 

if the pressure p is interpreted as the total momentum trans- 
ferred to the wall by the impact of the molecules. One has now 
only to assume that the energy U is a measure of temperature 
to obtain Boyle’s law of the isotherms. Then it follows from 
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thermodynamics that V is proportional to the absolute tempera- 
ture (see Appendix, 9) ; one has 

U = IET, 7 = RT, (6.2) 

where B is the ordinary gas oonstant. This is the complete 
equation of state (combined Boyle-Charles law), and one sees 
that the specific heat of a monatomic gas for constant volume 
is|B. 

I have mentioned these things only to stress the point that the 
kinetic theory right from the beginning produced verifiable 
numerical results in abundance. There could be no doubt that 
it was right, but what did it really mean ? 

How is it possible that probability considerations can be 
superimposed upon the deterministic laws of mechanics without 
a dash? 

These laws connect the state at a time t to the initial state, 
at time by definite equations. They involve, however, no 
restriction on the initial state. This has to be deter min ed by 
observation in every concrete case. But observations are not 
absolutely accurate; the results of measurements will suffer 
scattering according to Gauss’s rules of experimental errors. In 
the case of gas molecules, the situation is extreme ; for owing 
to the smallness and excessive number of the molecules, there is 
almost perfect ignorance of the initial state. 

The only facts known are the geometrical restriction of the 
position of each molecule by the walls of the vessel, and some 
physical quantities of a crude nature, like the resultant pressure 
and the total energy: very little indeed in view of the number of 
molecules (about 10 19 per c.o.). 

Henoe it is legitimate to apply probability considerations to 
the initial state, for instance the hypothesis of molecular chaos. 
The statistical behaviour of any future state is then completely 
determined by the laws of mechanics. This is in particular the 
case for statistical equihbrium ’, when the observable properties 
axe independent of time ; in this case any later state must have 
the same statistical properties as the initial state (e.g. it must 
also satisfy the condition of molecular chaos). How can this he 
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mathematically formulated ? It is convenient to use the equa- 
tions of motion in Hamilton’s canonical form (4.3, p. 18). The 
distribution is described by a function/^, q l9 q 2 9 - 9 q n9 pi 9 P 2 9 —>Pn) 
of all coordinates and momenta, and of time, such that fdpdq 
is the probability for finding the system at time £ in a given 
element dpdq = dp 1 ...dp n dq 1 ...dq n . One can interpret this 
function as the density of a fluid in a 2n-dimensional pq- space, 
called ‘ phase space’ ; and, as no particles are supposed to dis- 
appear or to be generated, this fluid must satisfy a continuity 
equation, of the kind (4.5, p. 20), generalized for 2 n dimensions, 
namely (see Appendix, 3) 


3/ , y Wfr) , _ Q 

dq k ^ 8p k ) 


(6.3) 


This reduces in virtue of the canonical equations (4.3), p. 18, 

= 0, (6.4) 


where [. H,f ] is an abbreviation, the so-called Poisson bracket, 


namely 




3?* 8 Pk tyk 3? J 


On the other hand, the convective derivative defined for three 
dimensions in (4.11, p. 21) may be generalized for 2n dimensions 
thus: . 


#_3/, W0/ . df \ 

dt~St+ l[Wu ,,+ W*H 


Then (6.4) says that in virtue of the mechanical equations 


The result expressed by the equivalent equations (6.4) and 
(6.7) is called Liouville’s theorem. The density function is an 
integral of the canonical equations, i.e. / = const, along any 
trajectory in phase space; in other words, the substance of the 
fluid is carried along by the motion in phase space, so that the 
integral j- _ J/jp*, (6.8) 

E 
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over any part of the substance moving in phase space is inde- 
pendent of time. 

Any admissible distribution function, namely one for which 
the probabilities of a configuration at different times are com- 
patible with the deterministic laws of mechanics, must be an 
integral of motion, satisfying the partial differential equation 
(6.4). For a closed system, i.e. one which is free from external 
disturbances (like a gas in a solid vessel), H is explicitly inde- 
pendent of time. The special oase of statistical equilibrium 
corresponds to certain time-independent solutions of (6.4), i.e. 


functions / satisfying 


[#,/] = 0 . 


(6.9) 


An obvious integral of this equation is / = <D(H), where <£> indi- 
cates an arbitrary function. This case plays a prominent part 
in statistical mechanics. 


Yet before continuing with these very general considerations 
we had better return to the ideal gases and consider the kinetic 
theory in more detail. In an ideal gas, the particles (atoms, 
molecules) are supposed to move independently of one another. 
Hence the function f(p, q) is a product of N functions /(x,^, t) 
each belonging to a single particle and all formally identical; 
x is the position vector and % — ( 1 jm) p the velocity vector. Then 
fdxdS, is the probability of finding a particle at time t at a specified 
element of volume and velocity. 

In the case where no external forces are present ( 8H/8t 0, 

dHjdx — 0) the Hamiltonian reduces to the kinetic energy, 


H — fymJE 2 = (l/2m)p 2 . 


The hypothesis of molecular chaos is expressed by assuming/ to 
be a function of Z 2 alone. This is indeed a solution of (6.9), as it 
can be written in the form / = 0(H) mentioned above. No 
other solution exists if the gas as a whole is homogeneous and 
isotropic (i.e. all positions and directions arc physically equiva- 
lent; see Appendix, 12). 

The determination of the velocity distribution function /(§*) 
was recognized by Maxwell as a fundamental problem of kinetic 
theory: it is the quantitative formulation of the ‘law of chance’ 
for this case. He gave several solutions; his first and simplest 
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reasoning was this: Suppose the three components of velocity 
to be statistically independent, then 

M 2 ) = M +€ !+«) = rtfiwawa). (6.10) 

This functional equation has the only solution (see Appendix, 1 3) 
4> = e“~«\ 

/== ^ 6 ‘ 11 ^ 
where a, jS are constants. 

This is Maxwell’s celebrated law of velocity distribution. 
However, the derivation given is objectionable, as the supposed 
independence of the velocity components is not obvious at all. 
I have mentioned it because the latest proof (and as I think the 
most satisfactory and rigorous and of the widest possible genera- 
lization) of the distribution formula uses exactly this Maxwellian 
argument, only applied to more suitable variables — as we shall 
presently see. 

Maxwell, being aware of this weakness, gave several other 
proofs which have been improved and modified by other authors. 
Eventually it appears that there are two main types of argu- 
ment: the equilibrium proof and the dynamical proof. We shall 
first consider the equilibrium proof in some detail. 

Assume each molecule to be a mechanical system with co- 
ordinates (j l9 <7 2 ,,.. and momenta f° r which we write 

simply y,#, and with a Hamiltonian H(p,q). The interaction 
between the molecules will be neglected. The total number n 
and the total energy V of the assembly of molecules are 
given. 

In order to apply the laws of probability it is convenient to 
reduce the continuous set of points p 9 q in phase space to a dis- 
continuous enumerable set of volume elements. One divides 
the phase space into N small cells of volumes V } cd 2 V,... 9 oj^V, 
where V is the total volume; henco 

= 1 . ( 6 * 12 ) 

To each coll a value of the energy H(p,q) can be attached, say 
that corresponding to its centre; let these energies be € X , e 2 ,..., e N , 
Now supposo the particles distributed over the colls so that there 
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are % in the first cell, n 2 in the second, etc., but of course with 
the restriction that the totals 

= n, (6.13) 

% e l4‘ }l 2 6 2 - H-"”l -M W € iV = ^ (6.14) 

are fixed. Liouviile’s theorem suggests that the probability of a 
single molecule being in a given cell is proportional to its volume. 
Making this assumption, one has to calculate the composite 
probability P for any distribution n lt n 2 n N under the restric- 

tions (6.13) and (6.14). 

This is an elementary problem of the calculus of probability 
(see Appendix, 14) which can be solved in this way: First the 
second condition (6.14) is omitted; then the probability of a 
given distribution n v n z ,..., n N is 

P(n v n N ) = — oJH wf*,...a>Sr. (6-16) 

If this is summed over all n 2 ,...,n N satisfying (6.13), one ob- 
tains by the elementary polynomial theorem 

2 P(7fc 1} = (wi+w 2 +...+a)jy) 91 ' = 1, (6.16) 

because of (6.12) — as it must be if P is a properly normalized 
probability. 

It is well known that the polynomial coefficients n ! /% ! n 2 ! . . .n N ! 
have a sharp maximum for n x = n 2 = ... = n N \ that means, if 
all cells have equal volumes (a^ = cj 2 = ... = co^) the uniform 
distribution would have an overwhelming probability. Yet this 
is modified by the second condition (6.14) which we have now 
to take into account. The simplest method of doing this proceeds 
in three approximations which seem to be crude, but are perfectly 
satisfactory for very large numbers of particles (n -> oo). The 
first approximation consists in neglecting all distributions of 
comparatively small n 1 ,n 2 ,...,n N ; then the n k can be treated as 
continuous variables. The second approximation consists in 
replacing the exact expression (6.16) by its asymptotic value for 
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large n k by using Stirling’s formula log(»!) -»■ w(log7i — 1) (see 
Appendix, 14), and the result is 

logP = — Wilog^— » 2 log«. 2 — log MjvH- const. 

(6.17) 

The third approximation consists in the following assumption: 
the actual behaviour of a gas in statistical equilibrium is deter- 
mined solely by the state of ma xim um probability; all other 
states have so little chance to appear that they can be neglected. 

Hence one has to determine the maximum of logP given by 
(6.17) under the conditions (6.13) and (6.14). Using elementary 
calculus this leads at once to 

n k = (6.18) 

where a and j8 are two constants which are necessary in order to 
satisfy the conditions (6.13), (6.14). Yet these constants play a 
rather different part. 

If one has to do with a mixture of two gases A and B with given 
numbers and n 1 - 7 ^, one gets two conditions of the type (6.13) 
but only one of the type (6.14), expressing that the total energy 
is given. 

Hence one obtains 

»fc*> = e« U) -P'*\ 4 B > = (6.19) 

with two different constants and but only one /?. There- 
fore j8 is the parameter of thermal equilibrium between the two 
constituents and must depend only on temperature. 

Indeed, if one now calculates the mean energy U and the 
mean pressure p , one oan apply thermodynamics and sees 
easily that the second law is satisfied if 

= ( 6 - 20 ) 

where T is the absolute temperature and k a constant, called 
Boltzmann’s constant. At the same time it appears that the 
entropy is given by 

S = HogP = — fc2> a logtt a . (6.21) 

a 

All those results are main ly due to Boltzmann ; in partic ular (6.18) 
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is called Boltzmann’s distribution law. It obviously contains 
Maxwell’s law (6.11) as a special case, namely for mass points. 

We have now to ask: Is this consideration which I called the 
equilibrium proof of the distribution law really satisfactory ? 

One objection can be easily dismissed, namely that the 
approximations made are too crude. They can be completely 
avoided. Darwin and Fowler have shown that one can give a 
rigorous expression for the mean value of any physical quantity 
in terms of complex integrals, containing the so-called ‘partition 
function’ (see Appendix 15) 

<D 1 z €l +a) 2 2 € *+—+ <o n z € * = F(z). (6.22) 

No distribution is neglected and no use is made of the Stirling 
formula. Yet in the limit n oo, all results are exactly the same 
as given by the Boltzmann distribution function. Although this 
method is extremely elegant and powerful, it does not introduce 
any essential new feature in regard to the fundamental question 
of statistical mechanics. 

Another objection is going deeper: can the molecules of a gas 
really be treated as independent ? 

There are numerous phenomena which show they are not, 
even if one considers only statistical equilibrium. For no real 
gas is ‘ideal’, i.e. satisfies Boyle’s law rigorously, and the devia- 
tions increase with pressure, ending in a complete collapse, con- 
densation. This proves the existence of long-range attractive 
forces between the molecules. The statistical method described 
above is unable to deal with them. The first attempt to correct 
this was the celebrated theory of van der Waals, which was 
followed by many others. I shall later describe in a few words 
the modem version of these theories, which is, from a certain 
standpoint, rigorous and satisfactory. 

More serious are the interactions revealed by non-equilibrium 
phenomena: viscosity, conduction of heat, diffusion. They can 
all be qualitatively understood by supposing that each molecule 
has a finite volume, or more correctly that two molecules have a 
short-range repulsive interaction which prohibits a close 
approach. This assumption has the consequence that there 
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exists an effective cross-section for a collision, hence a mean free 
path for the straight motion of a molecule. The coefficients of 
the three phenomena mentioned can be reduced, by elementary 
considerations, to the free path, and the results, as far as they go, 
are in good agreement -with observations. 

All this is very good physics producing in a simple and intuiti vo 
way formulae which give the correct order of magnitude of 
different correlated effects. 

But for the problem of a rigorous kinetic theory, which takes 
account of the interactions and is valid not only for equilibria, 
but also for motion, these considerations have only the value of 
a preliminaiy reconnoitring. The question is: flow can one 
derive the hydrodynamical equations of visible motion together 
with the phenomena of transformation and conduction of heat 
and, for a mixture, of diffusion ? 

This is an ambitious programme. For such a theory must 
include the result that a gas left to itself tends to equilibrium. 
Hence it must lead to irreversibility, although the law's of 
ordinary reversible mechanics are assumed to hold for the mole- 
cules. How is this possible ? Further, is the equilibrium obtained 
in this way the same as that derived directly, say by the method 
of the most probable distribution ? 

To begin with the last question. Its answer represents what I 
have called above the dynamical proof of the distribution la w 
for equilibrium. 

The formulation of the non-equilibrium theory of gases is due 
to Boltzmann. One can obtain his fundamental equation by 
generalizing one of the equivalent formulae ((5.-1 ) or ((>.7). These 
are based on the assumption that each molecule moves indepen- 
dently of the others according to the laws of mechanics, and they 
describe how the distribution/ of an assembly of such particles 
develops m time. Now the assumption of independence is 
dropped, hence the expression on the left-hand side of (U.4) or 
(6.7) is not zero; denoting by /(l) the probability density for a 
certain particle 1, one can write 


dfi}) 

dt 


mo 




0 ( 0 . 
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where 0(1) represents the influence of the other molecules on the 
particle 1; it is called the ‘collision integral’, as Boltzmann cal- 
culates it only for the case where the orbit of the centre of a 
particle can be described as straight and uniform motion 
interrupted by sudden collisions. For this purpose a new and 
independent application of the laws of probability is made by 
assumingthatthe probability ofacollision between two particles 1 
and 2 is proportional to the product of the probabilities of finding 
them in a given configuration, /( 1 )/( 2) . If one then expresses that 
some molecules are thrown by a collision out of a given element 
of phase space, others into it, one obtains (see Appendix, 16) 

0(1) = JJ {f'(l)/'(2)-/(l)/(2)}|5 1 -5 a | dbd^ (6.24) 

where/(2) is the same function as/(l), but taken for the particle 
2 as argument. /( 1), /( 2) refer to the motion of two particles 
.‘before’ the collision, /'(l), /'(2)to that ‘after’ the collision; one 
has to integrate over all velocities of the particle 2, (d| 2 ), and over 
the ‘cross-section’ of the collision, (db), which I shall not define 
in detail. ‘Before’ and ‘after’ the collision mean the asymptotic 
straight and uniform motions of approach and separation; it is 
dear that if the former is given, the latter is completely deter- 
minedfor any law of interaction force — it is the two-body problem 
of mechanics. Hence the velocities of both particles TE^ after 
the collision are known functions of those before the collision 
and (6.23) assumes the form of an integro-differential 
equation for calculating/. 

This equation has been the object of thorough mathematical 
investigations, first by Boltzmann and Maxwell, and later by 
modem writers. Hilbert has indicated a systematic method of 
solution in which each step of approximation leads to an integral 
equation of the normal (so-called Fredholm) type. Enskog and 
Chapman have developed this method, with some modifications, 
m detail. There is an admirable book by Chapman and Cowling 
which represents the whole theory of non-homogeneous gases as 
a consequence of the equation (6.23). I can only mention a few 
points of these important investigations. 

The first is concerned with the question of equilibrium. Does 
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the equation (6.23) really indicate an irreversible approach from 
any initial state to a homogeneous equilibrium ? This is in fact 
the case, and a very strange result indeed: the metamorphosis 
of reversible mechanics into irreversible thermodynamics with 
the help of probability. But before discussing this difficult 
question, I shall indicate the mathematical proof. 

From the statistics of equilibrium it is known how the entropy 
is connected with probability, namely by equation (6.21). 
Replace here the discontinuous n k by the continuous / and 
summation by integration over the phase space, and you obtain 

S = — Jc jf(l)logf(l)dqdp. (6.26) 

If one now calculates the time derivative dS/dt by substituting 
df(l)jdt from (6.23), and assuming no external interference, one 
finds (see Appendix, 17) 



where the = sign holds only if/(l) is independent of the space 
coordinates and satisfies, as a function of the velocities, 

/d)/(2)=/'(l)/'(2) (6.27) 

identically for any collision. 

The result expressed by (6.26) is often quoted as Boltzmann’s 
H-theorem (because he used the symbol H for — S/k). Boltz- 
mann claimed that it gave the statistical explanation of 
thermodynamical irreversibility. 

Equation (6.27) is a functional equation which determines f 
as a function of ‘collision invariants’, like total energy and total 
momentum. If the gas is at rest as a whole, the only solution of 
(6.27) is Maxwell’s (or Boltzmann’s) distribution law: 

/ = e“~0«, H(p, q) = e. (6.28) 

This is what I called the dynamical proof, and is amost remarkable 
result indeed; for it has been derived from the mechanism of 
collisions, which was completely neglected in the previous 
equilibrium methods. This point needs elucidation. 

Before doing so, let mo mention that the hydro-thermal 
equations of a gas, i.e. the equations of continuity, of motion and 
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of conduction of heat, are obtained from Boltzmann’s equation 
(6.23) by a simple formal process (multiplication with 1,| and 
\rriX? followed by integration over all velocities) in terms of the 
stress tensor T— you will remember Cauchy’s general formula 
(4.9)— which itself is expressed in terms of the distribution 
function/. To give these equations a real meaning, one has to 
expand/in terms of physical quantities, and this is the object of 
the theories contained in Chapman and Cowling’s book. In this 
wayavery satisfactory theory of hydro-thermodynamicsof gases, 
including viscosity, conduction of heat, and diffusion, is obtained. 

STATISTICAL MECHANICS 

I remember that forty years ago when I began to read scientific 
literature there was a violent discussion raging about statistical 
methods in physics, especially the 17-theorem. The objections 
raised have been classified into two types, one concerning reversi- 
bility, the other periodicity. 

Loschmidt, like Boltzmann, a member of the Austrian school, 
formulated the reversibility objection in this way: by reversing 
all velocities you get from any solution of the mechanical 
equations another one — how can the integral S, which depends 
on the instantaneous situation, increase in both cases ? 

The periodicity objection is based on a theorem of the great 
French mathematician Henri Poincar6, which states that every 
mechanical system is, if not exactly periodic, at least quasi- 
periodic. This follows from Liouville’s theorem according to 
which a given region in phase space moves without change of 
volume and describes therefore a tube-shaped region of ever 
increasing length. As the total volume available is finite (it is 
contained in the surface of maximum energy), this tube must 
somewhere intersect itself, which means that final and initial 
states come eventually near together. 

Zermelo, a German mathematician, who worked on abstract 
problems like the theory of Cantor’s sets and transfinite num- 
bers, ventured into physics by translating Gibbs’s work on 
statistical mechanics into German. But he was offended by the 
bgical imperfections of this theory and attacked it violently. 

e used in particular Poincare’s theorem to show how scanda- 
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lous the reasoning of the physicists was: they claimed to have 
proved the irreversible increase of a mechanical quantity for 
a system which returns after a finite time to its initial state with 
any desired accuracy. 

These objections were not quite futile, as they led two dis- 
tinguished physicists, Paul Ehrenfest and his wife Tat j ana, to 
investigate and clear up the matter beyond doubt in their well- 
known article in vol. iv of the Mathematical Encyclopedia . 

To-day we hardly need to follow all the logical finesses of this 
work. It suffices to point out that the objections are based on the 
following misunderstanding. If we describe the behaviour of the 
gas (we speak only of this simple case, as for no other case has 
the //-theorem been proved until recently) by the equation (6.4), 
taking for H the Hamiltonian of the whole system, a function of 
the coordinates and momenta of all particles, then / is indeed 
reversible and quasi-periodic, no //-theorem can be proved. 

Boltzmann’s proof is based not on this equation, but on (6.23), 
where now H is the Hamiltonian of one single molecule un- 
disturbed by the others, and where the right-hand term is not 
zero but equal to the collision integral C( 1 ). The latter is taken as 
representing roughly the effect of all the other molecules; 
roughly’, that means after some reasonable averaging. T his 
averaging is tho expression of our ignorance of the actual micro- 
scopic situation. Boltzmann’s theorem says that this equation 
mixing mechanical knowledge with ignorance of detail leads to 
irreversibility. There is no contradiction between the two state- 
ments. 

But there rises the other question whether such a modification 
of tho fundamental equation is justified. We shall see presently 
that it is indeed, in a much wider sense than that cl aim ed by 
Boltzmann, namely not only for a gas, but for any substance 
which can be described by a mechanical model. We have there- 
fore now to take up the question of how statistical methods can 
be applied to general mechanical systems. Without such a 
theory, one cannot even treat the deviations from the so-called 
ideal behaviour of gases (Boyle’s law), which appear at high 
pressure and low temperature, and which lead to condensation. 
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Theories like that of van der Waals have obviously only- pre- 
liminary character. What is needed is a general and. ’w&Q" 
founded formalism covering the gaseous, liquid, and solid state** 3 ’ 
under all kind of external forces. 

For the case of statistical equilibrium, this formalism 
supplied by Willard Gibbs’s celebrated book on Statistic cd 
■Mechanics (1901), which has proved to be extremely successf*-*"^ 
in its applications (see Appendix, 18). The gist of Gibbs’s idea 
is to apply Boltzmann’s results for a real assembly of many 
equal molecules to an imaginary or ‘virtual’ assembly of many 
copies of the system under consideration, and to postulate tb.a-'h 
the one system under observation will behave Hire the averag® 
calculated for the assembly. Before criticizing this assumption* 
let us have a glimpse of Gibbs’s procedure. He starts from 
Iiouville’s theorem (6.4) and considers especially the case of 
equilibrium where the partition function/ of his virtual assembly 
has to satisfy equation (6.9). He states that/ = $(H) is a soln- 
tion (as we have seen) and he chooses two particular forms of 
the function <D. The first is 

/= $(H) = const., if E <H < E+AE, 

= 0 outside of this interval, ’ ' 

where E is a given energy and A E a small interval of energy. 
(In modem notation one could write 0(H) = 8 (H—E), where 
8 is Dirac s symbolic function.) The corresponding distribution, 
he calls micro-canonical. 

The second form is just that of Maxwell— Boltzmann, 

/ = e“-£®, E(p,q) — E, (6.30) 

andthe corresponding distributions called canonical. Gibbs shows 
that both assumptions lead to the same results for the averages 
of physical quantities. But the canonical is preferable, as it is 
simpler to handle. £ turns out again to be equal for systems in 
thermal equilibrium; if one puts ft = IjlcT the formal relations 
between the averages constructed with (6.30) are a true replica 
of thermodynamics. For instance, the normalization condition 
for the probability 

J / dpdq = | e a ~P s dpdq = 1 


( 6 . 31 ) 



STATISTICAL MECHANICS 


61 


can be written 

F = oc/fi = kT log Z , Z = JJ e-BWdpdq. (6.32) 

This JP plays the part of Helmholtz’s free energy. The integral 
Z i to-day usually called the ‘partition function’, depends, apart 
from the energy E, on molar parameters like the vol um e V , 
All physical properties can be obtained by differentiation, e.g. 
entropy S and pressure p by 

O c)F dF 

S =-8T’ P= ~W (03) 

This formalism has been amazingly successful in treating thermo- 
mechanical and also thermo-chemical properties. For instance, 
the theory of real (non-ideal) gases is obtained by writing 

E = H(p,q) = K(p)+ 17(g), (6.34) 

where K is the kinetic and U the potential energy; the latter 
depends on the mutual interactions of the molecules. As K is 
quadratic in the p, the corresponding integration in Z is easily 
performed and the whole problem reduces to calculating the 
multiple integral 

Q = / - / dq 1 dq z ...dq N . (6.36) 

Still, this is a very formidable task, and much work has been 
spent on it. I shall mention only the investigations initiated by 
Ursell, and perfected by Mayer and others, the aim of which was 
to replace van der Waals’s semi-empirical equation of state by 
an exact one. In fact one can expand Q into a series of powers of 
V- 1 and, introducing this into (6.32) and (6.33), one obtains the 
pressure p as a similar scries 

FTL A B \ 

^ y y y^~y 2 (6.36) 

where the coefficients >1,1?,..., called virial coefficients, are 
functions of T . One can even go further and discuss the process 
of condensation, but the mathematical difficulties in the treat- 
ment of the liquid state itself are prohibitive. 

The range of application of Gibbs’s theory is enormous. But 
reading his book again, I felt the lack of a deeper foundation. 
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A few years later (1902, 1903) there appeared a series of papers 
by Einstein in which the same formalism was developed, 
obviously quite independently, as Maxwell and Boltzmann are 
quoted, but not Gibbs; these papers contained two essential 
improvements: an attempt to justify the statistical assumptions, 
and an application to a case which at once transformed the kinetic 
theory of matter from auseful hypothesis into something very real 
and directly observable, namely, the theory of Brownian motion. 

Concerning the foundation, Einstein used an argument which 
Boltzmann had already introduced to support his distribution 
law (6.18) — though this seems to be hardly necessary, as for a 
real assembly the method of enumerating distributions over 
cells is perfectly satisfactory. Curiously enough, this argument 
of Boltzmann is based on a theorem similar to Poincare’s con- 
siderations on quasi-periodicity with which Zermelo intended 
to smash statistical mechanics altogether. Einstein considers a 
distribution of the micro-canonical type, in Gibbs’s nomen- 
clature, where only one ‘energy surface’ H(p,q) = E in the phase 
space is taken into account. The representative point in phase 
space moves always on this surface. It may happen that the 
whole surface is covered in such a way that the orbit passes 
through every point of the surface. Such systems are called 
ergodic; but it is rather doubtful whether they exist at all. 
Systems are called quasi-ergodic where the orbit comes near to 
every point of the energy surface; that this happens can be seen 
by an argument s imila r to that which leads to Poincare’s theorem 
of quasi-periodicity. Then it can be made plausible that the total 
time of sojourn of the moving point in a given part of the energy 
surface is proportional to its area; hence the time average of any 
function of p, q is obviously the same as that taken with the help 
of a micro-canonical virtual assembly. In this way quasi- 
periodicity is used to justify statistical mechanics, exactly 
reversing Zermelo’s reasoning. This paradox is resolved by the 
remark that Zermelo believes the period to be large and macro- 
scopic, while Einstein assumes it to be unobservably small. 
Who is right ? You may find the obvious answer for yourselves 
(see Appendix, 19). 
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Modem writers use other ways of establishing the foundations 
of statistical mechanics. They are mostly adaptations of the 
cell-method to the virtual assembly; one has then to explain why 
the average properties of a single real observed system can be 
obtained by averaging over a great many systems of the virtual 
assembly. Some say simply: As we do not know the real state, 
we have the right to expect the average provided exceptional 
situations are theoretically extremely rare — and this is of course 
the case. Others say we have not to do with a singlo isolated 
system, but with a system in thermal contact with its surround- 
ings, as if it were in a thermostat or heat bath; wo can then 
assume this heat bath to consist of a great many copies of the 
system considered, so that the virtual assembly is transformed 
into a real one. I think considerations of this kind are not very 
satisfactory. 

There remains the fact that statistical mechanics has justified 
itself by explaining a great many actual phenomena. Among 
these are the fluctuations and the Brownian motion to which 
Einstein applied his theory (sec Appendix, 20). To appreciate 
the importance of this step one has to remember that at that 
time (about 1900) atoms and molecules were still far from being 
as real as they are to-day — there were still physicists who did 
not believe in them. After Einstein’s work this was hardly 
possible any longer. Visible tiny particles suspended in a gas or a 
liquid (colloid solution) are tost bodies small enough to reveal 
the granular structure of the surrounding medium by their 
irregular motion. Einstein showed that the statistical properties 
of this movement (mean density, mean square displacement in 
time, etc.) agree qualitatively with the predictions of kinetic 
theory. Perrin later confirmed those results by exact measure- 
ments and obtained the first reliable value of Avogadro’s num- 
ber N, the number of particles per mole. From now on kinetic 
theory and statistical mechanics were definitely established. 

But beyond this physical result, Einstein ’s theory of Brownian 
motion had a most important consequence for scientific metho- 
dology in general. Tho accuracy of measurement depends on the 
sensitivity of tho instruments, and this again on the size and 
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weight of the mobile parts and the restoring forces acting on 
them. Before Einstein’s work it was tacitly assumed that 
progress in this direction was limited only by experimental 
technique. Now it became obvious that this was not so. If an 
indicator, like the needle of a galvanometer, became too small 
or the suspending fibre too thin, it would never be at rest but 
perform a kind of Brownian movement. This has in fact been 
observed. S imi l ar phenomena play a large part in modern 
electronic technique, where the limit of observation is given by 
irregular variations which can be heard as a ‘noise’ in a loud 
speaker. There is a limit of observability given by the laws of 
nature themselves. 

This is a st r i king example that the code of rules for inference 
by induction, though perhaps metaphysical in some way, is 
certainly not a priori , but subject to reactions from the know- 
ledge which it has helped to create. For those rules which taught 
the experimentalist how to obtain and improve the accuracy of 
his findin gs contained to begin with certainly no hint that there 
is a natural end to the process. 

However, the idea of u nlimi ted improvement of accuracy need 
not he given up yet. One had only to add the rule: make your 
measurements at as low a temperature as possible. For Brownian 
motion dies down with decreasing temperature. 

Yet later developments in physics proved this rule also to be 
ineffective, and a much more trenchant change in the code had 
to be made. 

But before dealing with this question we have to finish our 
review of statistical methods in classical mechanics. 

GENERAL KINETIC THEORY 

Kinetic theory could only be regarded as complete if it 
applied to matter in (visible) motion as well as to equilibrium. 
But if you look through the literature you will find very little— 
a few simple cases. The most important of these, the theory of 
gases, has been dealt with in some detail. Two others must bo 
mentioned : the theory of solids and of the Brownian motion. 

Ideal solids are crystal lattices or gigantic periodic molecules. 
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But only for zero temperature are the atoms in regularly spaced 
equilibrium positions; for higher temperatures they begin to 
vibrate. As long as the amplitudes are small, the mutual forces 
will be linear functions of them; then the vibrations can be 
analysed into ‘normal modes’, each of which is a wave r unni ng 
through the lattice with a definite frequency. These normal 
modes represent a system of independent harmonic oscillations 
to which Gibbs’s method of statistical mechanics can be applied 
without any difficulty. If, however, the temperature rises, the 
amplitudes of the vibrations increase and higher terms appear in 
the interaction: the waves are scattering one another and are 
therefore strongly damped. Hence there exists a kind of free 
path for the transport of energy which can be used for expl aining 
conduction of heat in crystals (Debye). Similar considerations, 
applied to the electrons in metallic crystals, are used for the 
explanation of transport phenomena like electric and thermal 
conduction in metals. 

In the case of Brownian motion, I have already mentioned that 
Einstein calculated not only the mean density of a colloidal 
solution, say, under gravity, but also the mean square displace- 
ment of a single suspended particle in time (or, what amounts to 
the same, the dispersion of a colloid by diffusion as a function 
of time). The simplifying assumption which makes this possible 
is that the mass of the colloidal particle is large compared with the 
mass of the surrounding molecules, so that these impart only 
small impulses. Similar considerations have been applied to 
other fluctuation phenomena (see Appendix, 20). 

A great number of more or less isolated examples of non- 
equilibria have been treated by a semi-empirical method which 
uses the notion of relaxation time. You find a very complete 
account of such things related to solids and liquids in a book of 
J . Frenkel, Kinetic Theory of Liquids . But you must not expect 
to find in this work a systematic theory, based on a general idea, 
nor will you find it in any other book. 

My collaborator Dr. Green and I have tried to fill this gap, 
and to develop the kinetic theory of matter in general. I hope 
you will not mind if I indulge a little in the pleasure of expl ainin g 
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the leading ideas. It will help the understanding of the interplay 
of cause and chance in the laws of nature. 

We have to remember the general principles laid down by 
Gibbs which he, however, used only for the case of statistical 
equilibrium. 

An arbitrary piece of matter, fluid or solid, is, from the atomis- 
tic standpoint, a mechanical system of particles (atoms, mole- 
cules) defined by a Hamiltonian H. Its state is completely 
determined if the initial values of coordinates and momenta are 
given. Actually this is not the case; but there is a probability 
(as yet unknown) f°(p, q)dpdq for the initial distribution. The 
causal laws of motion demand that the distribution f(t 9 p,q) at 
a later time t is a solution of the Liouville equation (6.4) (p. 49) 

| ( 8 - 37 ) 


namely that solution which for t = 0 becomes f°, 

Mp,q)=f°(p,q). (6.38) 


Let us assume for simplicity that all molecules are equal 
particles (point masses) with coordinates x® and velocities 
I*® = p (fc) /m. We shall consider /to be a function of these and 
write f(t, x, |). If we want to indicate that afunction /depends on h 
particles, we do not write all arguments, but simply /*( 1, 2,...h) or 
shortly /*. As allthe particles are physically indistinguishable, we 
can assume all the functions/* to be symmetrical in the particles. 

ow the physicist is not direotly interested in a symmetric 
solution/*, of (6.37). He wants to know such things as the num- 
ber density (number of particles per unit volume) n^t, x) at a 
given point x of space, or perhaps, in addition, the velocity dis- 
- 11 ° n x ’ ©* M those quantities which are familiar 


n v C -r eor y ° f § ases j depending on one particle only. 

W 6 7 ^ 6rf ? re t0 redUOe tke functi °n/iv for 2 ? particles step 
by step t° the function A of one particle. 

ooTt^'l 0116 ty 5 tegrating 0Terthe Position and velocity of 
operator ^ ^ ^ ° ne ’ the hel P of the integral 

Xe "‘ = JJ • (6.39) 
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If/a+x » given, we obtain f a by applying the operator v„ +1 • it is 
however, convenient to add a normalizing factor and to write ’ 

2)fq Xq +1 f 2 + 1 - (6.40) 

The physical meaning of the operation is this: we give np the 
pretence to know the whereabouts of one particle and declare 
frankly our ignorance. By repeated application of the operation 
we obtain a chain of functions ? 

fjv> /n-1, .... f 2 , f x , ( 6 . 41 ) 

to which one can add/ 0 = 1 ;/ f means the probability of finding 
the system in a state where q particles have fixed positions (i.e. 
lie in given elements). The normalization is such that 

/A ft 5®) = nx(t, x«>) (6 . 4 2) 

is the number density; for one has 

/ nx(t, x<») dx 0) = ff fxdxP^Oi = xJi = N> (6 43) 
where the last equality follows from (6.40) for q = 0 (with 

Now we have to reduce the fundamental equation (6.37) step by 
step by repeatedly applying the operator x (see Appendix, 21). 
Assuming that the atoms are acting on another with central 
forces, being the potential energy between two of them the 
result of the reduction is a chain of equations of the form ' 

~St ~ (? = 1, 2,..., N), (6.44) 

where S Q (6.46) 

This quantity S v will bo callod the statistical term. What is the 
advantage of this splitting up of the problem into the solution of 
a chain of equations ? The first impression is that there is no 
advantage at all; for to determine/! you need to know S lt but & 
contains / 2) and this again depends on / 3 , and so on, so that one 
eventually arrives at/*, which satisfies the original equation. 
Yet this reasoning supposes the desire to get information about 
every detail of the motion, and that is just what wo do not want. 
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We 'wish, to obtain some observable and rather crude averages. 
Starting from/j and climbing up to/ a ,/ 8 ,..., we can soon stop, as 
the chaos increases with the number of particles, and replace the 
rigorous connexion between f g and / a+1 by an approximate one, 
according to the imperfeotion of observation. 

Before explaining the application of this ‘method of ignorance* 
to simple examples,. I wish to mention that we have actually 
found the chain of equations (6.44) in quite a different way, 
starting with/j and using the calculus of probability for events 
not independent of one another (see Appendix, 22). 

This derivation is less formal than the first one and illuminates 
the physical meaning of the statistical term. 

It would now be very attractive to show how from this general 
formula (6.44) the mechanical and thermal laws for continuous 
substances can be derived. But I have to restrict myself to a 
few indications concerning the general ‘method of ignorance’, 
to which I have already alluded. 

The first example is the theory of gases. We have seen that 
this theory is based on Boltzmann’s equation (6.23) 

ffl = [ff,/(l)]+(7(l), (6.46) 

where 0(1) is the collision integral (6.24) ; 

C (!) = J [/'(l)/ , (2)-/(l)/(2)]|5 1 -5 2 | dbd% 2 . (6.47) 

Now (6.46) has the same form as our general formula (6.44) for 
2=1, provided 0(1) can be identified with S v 

Green has shown that this is indeed the case, provided that the 
molecular forces have a small range r 0 ; then one can assume that 
in the gaseous state the probability of finding more than two 
particles in a distance smaller than r 0 is negligible. In other 
words, one can exclude all except ‘binary’ encounters. Two 
particles outside the sphere of interaction can be regarded as 
independent; hence one has there 

/s(l,2) =/i(l)/i(2). (6.48) 

This hoWs also, in virtue of Iiouvfile’s theorem, if on the left- 
hand side the positions and velocities refer to a point in th 
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interior of the sphere of action while on the right-hand side the 
values on its surface are used. With the help of this fact, the 
integration in S 1 can be performed (see Appendix, 23), and 
leads exactly to the expression (7(1), in which only the ‘boundary 
values’ of the functions/(l) and/(2) on the surface of the sphere 
of action appear. 

Hence the whole kinetic theory of gases is contained as a 
special case in our theory. 

Concerning liquids, one must proceed in a different way, 
because triple and higher collisions cannot be handled with 
elementary formulae. We have adopted a method suggested by 
the American physicist Kirkwood. His formula is a generali- 
zation of (6.48), namely 


3) 


/i(». »)/«(»> l)/«(l.g) 

/i(l)A(2)/i(3j ’ 


(0.40) 


and may be interpreted in different ways, e.g. by saying that the 
occurrences of three pairs of particles (2, .'!), (3, 1 ), ( 1 , 2) at given 
positions and with given velocities are almost, independent 
events, because the mutual interactions decrease rapidly with 
the distance. 

Substituting f s from (6.49) in S 2 , one obtains from (6.44), 
(6.45) two integro-difforontial equations for/j and f 2 which form 
a closed system and can bo solved by suitable approximations. 
(If then / 3 is calculated from the solution /„/ a , with the help of 
(6.49), the relation (6.40) for <j 3 is not necessarily satisfied; 
this is the sacrifice of accuracy introduced by the Kirkwood 
method.) 

All physical properties of a liquid of the kind discussed hero 
(particles with central forces) can be expressed in terms of 
n 2 (l, 2), afunction known to the experimenters iti X -ray research 
on liquids as the radial distribution function. The method 
explained loads to explicit formulae for the equation of state 
and the energy; it allows also a discussion of the singularity 
which separates the gaseous and liquid states. Hut 1 cannot 
enter into a discussion of details. 

Concerning non-equilibria, one can obtain f lu*, differential 
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equations for the mechanical and thermal flow in a rigorous way; 
the result has, of course, the form of Cauchy’s equations (4.9) 
for continuous media, yet with a stress tensor T a p which can be 
explicitly expressed in terms of the time derivatives of the strain 
tensor (or the space derivative of the velocity) and the gradient 
of temperature. In this way expressions for the coefficients of 
viscosity and thermal conductivity are obtained. They differ 
from the known formulae for gases by the great contribution of 
the mutual forces. Yet again I cannot dwell on this subject 
which would lead us far from the main topic of these lectures, 
to which I propose now to return (see Appendix, 33). 
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What can we learn from all this about the general problem of 
cause and chance? The example of gases has already shown us 
that the introduction of chance and probability into the laws of 
motion removes the reversibility inherent in them; or, in other 
words, it leads to a conception of time which has a definite 
direction and satisfies the principle of antecedence in the 
cause-effect relation. 

The formal method consists in defining a certain quantity, 
the entropy r 

° — —K , 

jfdpdq 


(7.1) 


and showing that it never decreases in time : dS/dt 0. In the 
case of a gas, the function / was the distribution function f x of 
one single molecule, a function of the point p,q of the phase 
space of this molecule. 

The same integral represents also the entropy of an arbitrary 
system in statistical mechanics, if f is replaced by f N} the 
distribution function in the SiV-dimonsional phase space; it 
satisfies all equilibrium relations of ordinary thermodynamics. 

In the case of a gas, the time derivative of 8 could bo deter- 
mined with the help of Boltzmann’s collision equation, and it 
was found that always 


dS 

dt 


> 0 . 


(7.2) 


I have stressed the point that this is not in contradiction to 
the reversibility of mechanics; for this reversibility refers to a 
distribution function of non-interacting molecules, satisfying 

| -[*./]. (7.3) 

while molecules colliding with one another satisfy 


(7.4) 
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where C is the collision integral. Irreversibility is therefore a , 
consequence of the explicit introduction of ignorance into the 
fundamental laws. 

Now the same considerations hold for any system. If we take 
for / the function f N of a closed system of N particles, (7.3) is 
again satisfied, and if its solution is introduced into (7.1), it can 
be easily shown that dSJdt = 0. 

Irreversibility can be understood only by explicitly exempting 
a part of the system from causality. One has to abandon the 
condition that the system is closed, or that the positions and 
velocities of all particles are under control. The remarkable 
thing is that it suffices to assume one single particle beyond con- 
trol. Then we have to do with a system of JV-f 1 particles, but 
concentrate our interest only on N of them. The partition func- 
tion of these N particles satisfies the equation (6.44) for q = N: 


^ — \Mi vJn]+S n , (7.6) 


wheredtf is a certain integral overf N+1 given by (6.46) for q = N. 
For a solution of this equation (7.6) the entropy is either constant 
or increasing. This is of course o fortiori the case if the system 
of N particles is coupled to more complicated systems out of 
control (see Appendix, 24). 

The increase of S continues until statistical equilibri um is 
reached, and it can be shown that the final distribution is the 
canonical one 

f N = e“-0®, E(p,q) = E. (7.6) 


Thti result is, in my opinion, the final answer of the age-old 
question how the reversibility of classical mechanics and the 
irreversibility of thermodynamics can be reconciled. The latter 
IB due to a deliberate renunciation of the demand that in prin- 
ciple the fate of every single particle can be determined. You 
must violate mechanics in order to obtain a result in obvious 
contradiction to it. But one may say: this violation may be 
necessary from practical reasons because one can neither observe 
all particles nor solve the innumerable equations-in reality, 
however, the world is reversible, and thermodynamics only a 
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trick for obtaining probable, not certain, results. This is the 
standpoint taken in many presentations of statistical mechanics. 
It is difficult to contradict if one accepts the axiom that the 
positions and velocities of all particles can, at least in principle, 
be determined — but can this really be maintained? We have 
seen that the Brownian motion sets a limit to all observations 
even on a macroscopic scale. One needs a spirit who can do 
things we could not even do with infinitely improved technical 
means. Further, the idea of a completely closed system is also 
almost fantastic. 

I think that the statistical foundation of thermodynamics is 
quite satisfactory even on the ground of classical mechanics. 

But in fact, classical mechanics has turned out to be defective 
just in the atomic domain where we have applied it. The whole 
situation has therefore to be re-examined in the light of quantum 
mechanics. 
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In order not to lose sight of my main subject I have added to the 
heading of each section of these lectures words like ‘cause’, 
‘contiguity’, ‘antecedence’, ‘chance’. The one for the present 
section, ‘matter’, seems to be an intruder. For classical philo- 
sophy teaches that matter is a fundamental conception of a 
specific kind, entirely different from cause, though on the same 
level in the hierarchy of notions: another ‘category’ in Kant’s 
terminology. This doctrine was generally accepted at the tim e 
before the great discoveries were made of which I have now to 


speak. It was the period when physics was governed by the 
dualism ‘force and matter’, Kraft und Staff (the title of a popular 
book by Buchner). In modem physics this duality has become 
vague, almost obsolete. The first steps in this direction have been 
described in the preceding survey: the transition from Newton’s 
distance forces to contact forces, first in mechanics, then in 
electromagnetism, and finally for gravitation; in other words, 
the victory of the idea of contiguity. If force is spreading in 
empty space with finite velocity, space cannot be quite empty; 
there must be something which carries the forces. So space is 
ffled with ether, a kind of substance akin to ordinary matter 
in many respects, in which strains and stresses can be produced. 
Though these contact forces obey different laws from those which 
govern elasticity, they are still forces in an ether, something 
different from the carrier. Yet this distinction vanishes more 
and more. Relativity showed that the ether does not share with 
ordinary matter the property of ‘localization’: you cannot say 
here I am’; there is no physical way of identifying a point in the 
as you could recognize a point in running water by a 
little mark, a particle of dust. Electric and magnetic stresses 
arenotsomeaungmtheether, they are ‘theether’. The question 
Ot a earner becomes meaningless. 

However, this is a question of interpretation. Physicists are 
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very broad-minded in this respeot ; they mil continue using obso- 
lete expressions like ether, and no harm is done. For them a 
matter of terminology is not serious until a new quantitative 
law is involved. That has happened here indeed. I refer to the 
law connecting mass m, energy e, and the velocity of light c 
(see Appendix, 25), # = /« n 


■which, after having been found to hold in special cases, was 
generally established by Einstein. His reasoning is based on the 
existence of the pressure of light, demonstrated experimentally 
and also derivable from Maxwell’s equations of electrodynamics. 
If a body of mass M emits a well-defined quantity of light in a 
parallel beam which carries the total electromagnetic energy e, 
it suffers a recoil corresponding to the momentum c/c transferred. 
It therefore moves in the opposite direction, and to avoid a clash 
with the mechanical law that the centre of mass of a system 
cannot he accelerated by an internal process, one has to ascribe 
to the beam of light not only an energy e and momentum e/c but 
also amass e/c 2 , and to assume that the mass M of the emitting 
body is reduced by the same amount m = e/c 2 .* f* 

The theory of relativity renders this result quite natural. It 
provides, moreover, an expression for the dependence of mass 
on velocity; one has 


m — U 

vU -v*ic*y 


where m 0 is called the rest-mass. Energy e and momentum p are 
then given by € = mc ^ p = mv. (8.3) 


I need hardly to remind you how this result of ‘purest science’ 
has been lately confirmed by a terrifying, horrible, ‘technical’ 
application in Now Mexico, Japan, and Bikini. There is no 
doubt, matter and energy are the same. The old duality between 
the force and tho substance on which it acts, has to be aban- 
doned, and hence also tho original idea of force as tho cause of 
motion. We soo how old notions arc dissolved by new ex- 
periences. It is this process which has led mo to tho abstract 


t M. Born, Atomic Physh'x (Blaokio). 4th od. p 1048, Ch. in. 2, p. 62; A. VII, 
p. 288. Soo Appendix, 25. 
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definition of causality based only on the notion of physical 
dependence, but transcending special theories which change 
according to the experimental situation. 

Eetuming to our immediate object, we learn from Einstein’s 


law that the atomistic conception of matter is necessarily con- 
nected with the atomistic conception of energy. In fact the 
existence of quanta of energy was deduced by Planck from tho 
laws of heat radiation five years before Einstein published his 
relation between mass and energy. 

Planck s discovery opened the first chapter in tho history of 
quantum theory, which corresponds to the years 1900 to 1013 
and could be entitled ‘Tracing the quantum by thermodynamical 
and statistical methods’. The next chapter deals with the period 
1913-25 when spectroscopical and electronic methods were in 
the foreground, while the last chapter describes tho birth and 
development of quantum mechanics. 


I cannot possibly give an account of this long and tedious 
evelopment, but I shall pick out a few points which are not so 
well known and hardly found in text-books, beginning with some 
remarks on the thermo-statistical quantum hunt. 

., pyj 1 ®* 1 ^fcch Planck solved was the determination of 

the density of radiation P in equilibrium with matter of a given 
temperature T as function of T and of the frequency v, so that 
P(v, T) dv is the energy per unit volume in the frequency interval 

tMs fifJT 17 tte ™°dynamical methods several properties of 
thm function were known: the temperature dependence of tho 

total radiation j pdv = aT « (law of Stefan and Boltzmann) and 

^fedcatathat^is only a function of the quotient V /T ,\ f 

aa being one regards the radiation 

* 86 0t ‘ * Utefc inter, 
then the mean e„„„ t ”,*" 1 teplao6d Wionio oscilktora; 

wm the method LefaJd b, *° to it. Thi« 
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tion itself as a system of oscillators, each of these representing 
the amplitude of a plane wave. This method was used by Ray- 
leigh and later by Jeans. In both cases the relation between the 
mean energy u(v) of the oscillators of frequency v and the radia- 
tion density p is givep. by f 

877V 2 

p = -3-^, (8.4) 


and it suffices to determine u. 

This can be done with the help of the so-called equipartition 
law of statistical mechanics. Suppose the Hamiltonian H of a 
system has the form 

| (8.5) 

where £ is any coordinate or momentum and H' contains all 
the other coordinates and momenta but not £. Then the mean 
value of the contribution to the energy of this variable £ is (see 
Appendix, 26) — , 

= ( 8 . 6 ) 


independent of the constant a — lienee the same for all variables 
of that description. 

Applied to a set of oscillators of frequency v, whore 



H - i(f»*+4 ***¥). 

(8.7) 


2m 

one obtains for the 

average energy 



« k r l \ 

(8.8) 

hence, from (8.4), 

cr 

(8.9) 


This is called the Rayleigh- deans radiat ion formula. It is a 
rigorous consequence of classical statistical mechanics, but 
nevertheless in obvious contradiction to facts. It does not even 
lead to a finite total radiation, since p increases as j* a with fre- 
quency. The law is, however, not quite absurd as it. agrees well 
with measurements for small frequencies (long waves) or high 
temperatures. At the other end of the spectrum, the observed 

t 800 Atomic VhyalcHf (!h. VII. I, p. 201 ; A. XXVIII, p. !M7. 
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energy density decreases again, and Wien has proposed for tl 
region an experimental law which would correspond to tj 
assumption that in (8.4) the oscillator energy is of the form 

u = u 0 ( 8.1 

This looks very much like a Boltzmann distribution. Accordii 
to Wien’s displacement law it holds for hi gh values of tl 



quotiaat v/y, and both constants * 0 and e 0 must be proportional 
to v; but then meaning is obscure. 

This was the situation which Planck encountered- two 

Sis *•*-*• < s - s > ^ <m«. £ 

™ ^ energy 
-•» ~ a * *. SSS*- fiSSV* 

his reasoning in a little rUtW^f c rf ' 1 SJla11 here 

?r, 40 ®“««. 1905). 

of statistical mechanics are mh 's? ?* S“ formulae 

Used> Starting from Boltzmann’s 
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distribution law, according to which the probability of finding 
a system in a state with energy e is proportional to e~0®, where 
P = 1/kT, one can express the mean square fluctuation of the 
energy (Sip = (i=ip = 6 -a 

in terms of the average energy e = u itself if the latter is given as 
function of temperature or of p (see Appendix, 20.10): 

W— % MU 

Now this function u(P) is known for the two limiting cases: T 
large or )3 small, and T small or p large, from (8.8) and (8.10), 


u 


_ I /8 -1 for small p, 
{ u 0 e-fa 


for large /?. 


( 8 . 12 ) 


_ _du _ I /3~ 2 = »*, small p; 
dp ~ (tf 


Hence one has 

TSS?= -rr Urge p. < 8 - 13 ) 

Now Planck argues like this: the two limiting cases will corre- 
spond to the preponderance of two different causes, whatever 
they may be. A well-known theorem of statistics says that the 
mean square fluctuations duo to independent causos are additive. 
Let us assume that the condition of independence is here satisfied. 
Hence, if both causes act simultaneously, wo should have 


du 


(A*)* = ~=e 0 u+u*. 


(8.14) 


u = 


H 

e*H3e,_ 


This is a differential equation for u, with the general solution 

(8.15) 

The constant of integration a must vanish in order to have the 
limiting cases (8.15) all right. Wien’s displacement law, accord- 
ing to which p/v 3 — H-ttv,/c 3 v depends only on Tjv, loads then to 
e o — hv, where A is a constant, known sis Planck’s constant. 
The result is Planck s formula for the mean oscillator energy 

hv 1 

<«•>«> 

from which the radiation density follows according to (8.9); a* 
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refined interpolation which turned out to h« • 

meat ’with experiment that Planck looked fS S0 , ezceUent a « ree - 

taon a^d discovered it in the assumption tf 

fcute size * 0 = fa if ^ en ^ P ° f e * er g7 quanta of 
(6.32) has to be replaced by the sum ^ e € o> integral 


-Z = T e^feo n ___ 1 

n= =0 1 — erfieo * 


*- e (8.17) 

<«>a^S^^r(8 P ^)fo^'S d t in loads .too* 

P©rty of the atoms, represented w ™ n f f ergy was a Fo- 
with radiation, which itself h if seiUators m their interaction 

years later Eix^tho^^^ »<*“*%• Seven 

occur in atomic system « +U • m ^ eed wherever oscillations 

which opened more thafTorf*^ of molec ules and solids 

■ ^^2fSaST;r i7ed <jMad7 ta 1906 •* «- «• 

assumed, that the auantn W&S ^ &B innocent as Planck 
tion and ought to he i™ ^7 “ mtrinsi 0 Property of radia- 

about. In text-books this rfrfval^fMh^t^ ° f parfcicles ^ 

of light is connected I lr ? * corpuscular theory 
electric effect alTsi^! 7 ^ * expla u*tion of the photo- 
electrons ^^rodbc^lbvHff^ 16110 ^ 16114 ^ ^ * 

but not theirCorf * ? ™ 6 ^ ™- 18 correct, 

which led Einstein to the ^ a statistical argument 

Photons, ss ' OT °* 1 “ !sis «* «—» <* hght, or 

(81 . 3) *»“ • different 

heat radiation is a statisfci!LT aV * * lleory of %ht is correct, then 
directions freeman,/ tetlcaI uuxture of harmonic waves of all 

*» Beotion ^ * “° n to * 

& ™»d by the Dutoif physicist* HA ^°" 1 ‘fon h f 'f™ P 61 ' 
y ysxcist, H. A. Lorentz, with the result 

t See Atomic Physic*, Ch. VH. 2, p. 207. 
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tl»t K? - P for any fequenoy, or expressed in terns of His 
eqmvstart oscillators, JS? = *>, in agreement niti the 
Bayleigh-Jeans case (small p, large T) in (8.13). Hence there 
must be something else going on besides the waves, for which 
(Ae) a = e 0 u; what can this be? 

Suppose Planck’s quanta exist really in the radiation and let 
n be their number per unit volume and unit frequency interval 
As each quant um ha s the energy S() = hv, one has e = u = ’ 

and (Ae) = eg(A n) z . Hence the fluctuation law in Wien’s case 
(large p, small T) of (8.13) can be written as 


(Sn) 2 _ n. (8 .18) 

This is a well-known formula of statistics referring to the 
following situation: a great number of objects are distributed at 
random in a big volume and n is the number contained in a part 
Then one has just the relation (8.18) between the average n 
and its mean square fluctuation (see Appendix, 20). So Einstein 
was led to the conclusion that the Wien part of the fluctuation 
of energy is accounted for by quanta behaving like independent 
particles, and he corroborated it by taking into account, besides 
the energy, also the momentum hv/c of the quantum and the 
recoil of an atom produced by it. Tt was this result which en- 
couraged him to look for experimental evidence and led liim to 
the well-known interpretation of the photo-electric effect as a 
bombardment of photons which knock out electrons from the 
metal transferring their energy to them. 

Expressed in terms of photon numbers the combined fluctua- 
tion law (8.14) roads 


(A») a - - . " n - 1 - * =? n(n-\- 1), (8.1!)) 

with the general solution 

I 

V ‘ ~~ I pro _ j ’ (H.20) 

where a = 0 leads to the correct value for large T. But what if 

at 0? 

Every physicist glancing at the last formula will recognize 
it as tho so-called Bose -Einstein distribution law for an ideal 
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gas of indistinguishable particles according to quantum theory. 
It is most remarkable that at this early stage of quantum theory 
Planck and Einstein have already hit on a result which was 
rediscovered much later (Einstein again participa tin g) (see 
Appendix, 25, 32). In fact Planck’s interpolation can be inter- 


preted in modem terms as the first and completely successful 
attempt to bridge the gulf between the wave aspect and the 


particle aspect of a system of equal and independent components 
whatever they may be — photons or atoms. 


I shall conclude this section by giving a short account of 


another consideration of Einstein’s which belongs to a later 
period of quantum theory, when Bohr’s theory of atoms was 
already well established, namely the existence of stationary 
states in the atoms which differ by finite amounts of energy 
content. Suppose the atom can exist in a lower state 1 and a 
higher state 2; transitions are possible by emission or absorp- 
tion of alight quantum of energy e 2 — e x = e 0 , hence of frequency 
v = € 0 /h. On the other hand, according to Boltzmann’s law 
the relative number of atoms in the two states will be 


( 8 . 21 ) 

Now one can write (8.20), with « = 0, in the form 
(n+l)e~P" = n, 

or, using (8.21), = ^ (8JM) 

4116 following interpretation: 
tto . left-hand side repreeente the number of quanta emitted per 

an* of tune from the N, atoms in the higher state, the rigC 
1 “b^hed by the Nl atoms in the tower slL,, 

tuoproceeses winch in equiiibrium must of course turned one 

ato« *» —W »f 

present i e to nN rv^’ ^ ^ t0 th ® number * of photons 

"spo^^=“r ftetorm /-“ 

it comseond. to «,! of the presence of radiation ; 

omspoud. to the well-known emission of electromagnetic 
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waves by a vibrating system of charges. The other term nN 2 is 
a new phenomenon which was signalled the first time in this paper 
of Einstein (later confirmed experimentally), namely a forced 
or induced emission proportional to the number of photons 
present. 

If we denote the number of spontaneous emissions by AN 2i 
of induced emissions by B 21 N 2 n, of absorptions we 

learn from (8.22) that the probability coefficients (probabilities 
per unit time, per atom, and per light-quantum) are all equal: 

A = B^ 2 = B 2 i» (8.23) 

This result had far-reaching consequences. The first is the exis- 
tence of a symmetric probability coefficient B 12 = S 21 for transi- 
tion between two states induced by radiation. This became one 
of the clues for the discovery of the matrix form of quantum 
mechanics. 

The second point is seen if one considers, not equilibrium, but 
a process in time; Einstein’s consideration leads at once to the 
equation 

Tt = lT = - d -§ = (8.24) 

which is of the type used by the chemists for the calculation of 
reaction velocities. One has, in their terminology, three com- 
peting reactions, namely two diatomic ones and one monatomic 
one. Now genuine monatomic reactions are rare in ordinary 
chemistry, but abundant in nuclear chemistry; they were in 
fact until recently the only known ones, namely the natural 
radioactive disintegrations. If the radiation density is zero, 
n = 0, one has 

= AN 2 , (8.25) 

which is exactly the elementary law of radioactive decay, 
according to Rutherford and Soddy . It expresses the assumption 
that the disintegrations are purely accidental and completely 
independent of one another. 

Thus Einstein’s interpretation means the abandonment of 
causal description and the introduction of the laws of chance 
for the interaction of matter and radiation. 



84 


MATTER 


ELECTRONS AND QUANTA 

Although my programme takes me through the whole historv 

° P ’ * am weP aware that it is a very one-sided account of 
what roily has happaned. It will pot have escaped ZZ? 

believe progress in physics essentially due to the inductive 
me o (of which I hope to say a little more later), yet the experi- 
mentahst may rightly complain that his efforts and achieve 
mente are hardly mentioned. Yet as I am concerned ZflTI 

tn£~ ld ? S ^ Conceptions ’ 1 may be Permitted 
mventive « enius of the experimenters for 
th « ir “^<»ft'm y ppip M e^thopt detailed 

1900, Whel1 qmntum a««y sprang from 
the mvestigatxons of radiation, was also full of experimental 

tZZZ. -“*■ ^ “ d * he 

«w a r T rd th ® r ° le ° f cbance “ P h y 8ics - radioactivity was of 
special importance. As I said before, the law of decay is the 

xpression o independent accidental events. Moreover the 

tUmed 0Ut t0 be perfec % itive to S 
physica influences. There might be, of course, some internal 

Yet thl T °? the . at ° m wbicb dete rmine when it will explode. 

**”* from that ^ »» ^ »£* - 

^eTuntos^ rt Paramet6rS ’ ° r beUeVe We ^ them, they 

detE W if r x %e “ Statistics because of this lack of 
hSt idt Iha^h radi ° aCti ^> on the other hand, nobody 
lZ thGSe paiameter8 be, their nature itself 

this questi 0 rw O ni°7r er, i °"f bave toSJad the hope that 
to ordinary statist; *7 radioactive statistics reduced 

hlp^ 7 “ meChani0S - 111 fact > ** ^e opposite has 
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This result, together with J. J. Thomson’s discovery of the elec- 
tron, led to the planetary model of the atom: a number of elec- 
trons surrounding the nucleus, bound to it by electric forces. The 
fundamental difficulty of this model is its mechanical instability. 

As long as nothing was known about the forces which keep the 
elementary particles in an atom together one could assume a 
law of force which allowed stable equilibrium states. An in- 
genious model of this kind is due to J. J. Thomson. But now one 
knew that the forces were electrostatic ones, following Coulomb’s 
law, and these could never guarantee the extraordinary stability 
of the actual atoms which survive billions of collisions without 
any change of structure. Bohr connected this difficulty with the 
facts of spectroscopy, and the result was his well-known model 
of the atom consisting of ‘quantized’ electronic orbits. 

Mentioning spectroscopy, I feel again sadly how I have to 
skip over great fields of research with a few words. 

The discovery of simple laws in line spectra was in fact a great 
achievement. Still more important than numerical formulae, 
like the one discovered by the Swiss schoolmaster Balmer for 
the hydrogen spectrum, was the rule found by Ritz (also a 
Swiss, who unfortunately died quite young), the so-called com- 
bination principle; it says that the frequencies of the spectral 
lines of gases can bo obtained by forming differences of a single 
row of quantities T v T a , which are called terms: 


v nm -^n (8.26) 

though not all of these differences appear as lines in the spectrum. 
Balmer ’s formula for hydrogen is a special case where T n = JR/n 3 , 
namely 

— 2 J (m = 3,4,...). 


The formula (8.26) gave Bohr the clue to the application of 
quantum theory. Multiplying it by Planck’s constant h he 
interpreted it as the energy difference e nm = hv nm between any 
two stationary states having the energies e n = hT n (n = 1 , 2 ,...). 
This interpretation is a sweeping generalization of Planck’s 
original conception of discrete energy-levels of oscillators. It 
explained at once the stability of atoms against impacts with an 
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SmaJ ? r ? han a certain Cestoid, the difference between 
mission and absorption spectra (the latter being of the form 

a? -i * 1 ’ wbere 1 means the ground state), and was in 
e ai confirmed by the well-known experiments of Franck and 
JlCTtz (excitation of spectra by electron bombardment). 

However, I cannot continue to describe the whole develop, 
ment of quantum theory because that would mean writing an 
encyclopaedia of physics of the last thirty-five years. I have 
fjenthis short account of the initial period because it is fashion- 
able to-day to regard physics as the product of pure reason. 

lam not 80 reasonable as to say that physics could 
proceed by experiment only, without some hard thinking, nor 
o I deny that the forming of new concepts is guided to some 
degree by general philosophical principles. But I know from my 
own experience, and I could call on Heisenberg for confirmation, 
that the laws of quantum mechanics were found by a slow and 
tedious process of interpreting experimental results. I shall try to 

descnbethemaonstepsofthis process in the shortest possible way. 

Yet it must be remembered that these steps do not form a 
straight staircase upwards, but a tangle of interconnected alleys, 
xiowever, I must begin somewhere. 

There was first the question whether the stationary states are 
certain selected meohanical orbits, and if so, which. Proceeding 
from example to example (oscillator, rotator, hydrogen atom), 
quantum conditions’ were found (Bohr, Wilson, Sommerfeld) 

which for every periodic coordinate q of the motion can be 
expressed in the form 


1 = 


= hn, 


( 8 . 27 ) 


^ere jp is the momentum corresponding to q and the integration 

. Pad ! d - 1116 most evincing theoretical 
Moment for 'choosing these integrals / was given by Ehrenfest, 

^o showed that if the system is subject to a slow external per- 

eauaS^ 7 A- “ “ Vanajlt and therefore well suited to be 
equated to a discontinuous ‘jumping’ quantity hn. 

an^T ng th6Se + <adiab r atic ^variants’ I there is in particular the 
gulan momentum of a rotating system and its component in a 
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given direction ; if both are to be integer multiples of A, the strange 
conclusion is obtained that an atom could not exist in all orienta- 
tions but only in a selected finite set. This was confirmed by 
Stem and Gerlach’s celebrated experiment (deflecting an 
atomic beam in an inhomogeneous magnetic field). I am proud 
that this work was done in my department in Frankfort-on- 
Main. There is hardly any other effect which demonstrates the 
deviations from classical mechanics in so striking a manner. 

A signpost for further progress was Bohr’s correspondence 
principle. It says that, though ordinary mechanics does not 
apply to atomic processes, we must expect that it holds at least 
approximately for large quantum numbers. This was not so 
much philosophy as common sense. Yet in the hands of Bohr 
and his school it yielded a rich harvest of results, beginning with 
the calculation of the constant R in the Balmer formula, f The 
mysterious laws of spectroscopy were reduced to a few general 
rules about the energy-levels and the transitions between them. 
The most important of these rules was Pauli’s exclusion prin- 
ciple, derived from a careful discussion of simple spectra; it says 
that two or more electrons are never in the same quantum state, 
described by fixed values of the quantum numbers (8.27) / 

belonging to all periods, including the electronic spin (TJhlenbeck 
and Goudsmit). With the help of these simple principles the 
periodic system of the elements could be explained in terms of 
electronic states. But all these great achievements of Bohr’s 
theory are outside the scope of our present interest. I have, 
however, to mention Bohr’s considerations about the correspon- 
dence between the amplitudes of the harmonic components of a 
mechanical orbit and the intensity of certain spectral linaa 
Consider an atom in the quantum state n with energy e n «.n^ 
suppose the orbit can be, for large n, approximately described 
by giving the coordinates q as functions of time. As these will 
be periodic, one can represent q as a harmonic (Fourier) series, 
of the type 

q(t) = f a m (n)coa[2-7Tv{n)(mt+8J], (8.28) 

m*>l 

t Soe Atomic Physics, Ch. V. 1, p. 98; A. XIV, p. 300, 
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6 f nd * mental *»> and the amplitude* 

^ Wdepend on the number » of the orbit considered. In reality 
the frequencies observed are not v(n), 2v(n), 3 *(»),... but * 

1 

v nin ^( e n e m)> 

**“ am P HtadM > B was dear that the squama 

feSL ^ 0,Jd r- 1 ; «r wa r to a. transition pmh. 

of Planet’*” a* “ troduced b y Einstein in his derivation 

fj!*?* 8 ** (8.16). But how could the mth ovur- 

hetwltrlt^r^ *» >"*» 

This was the central problem of quantum physics in the years 

hetween 1913 and 1926. In particular there aLLgrmttaW 

m^snnngin^ of specteallines, with the ££££ 
ZZ5L l e ^^ me “'■"’•Pkotosietera. Simple laws for the 
(OnST Sf «f multiplets were discovered 

so ZlTi-f . pr666,lts<1 “ qsadratic tables which look 

“ “ tol 40 understand why this 
assonabon of ideas did not happen in some brain. 

worktop l K °‘ ,me a the of ‘ he physioist was still 

ridTfa i, 1 ^ 1 taeS> “ d “ n “**i a special eSort to get 
T to Up tte id «* ° f a coordinate 

S' ^ ^0. tsPtoented by a Fourier series like 

S ilTs^f "* T™*™ “ *“■ fonnofa and to 

term, as representative of the 

amplitudes a In) hv e0WPe P°aaible to replace the Fourier 
eqidvsdent indict 7 q T™ ^PHtudes u(m,„) with two 

tm(») 5o*Wtj-w(»)->.c(m,») = |o(».,h)6(h,»). (8.29) 

^Z’ZlZT’"- KieOti0 “ oftnuiitionl concepts by a 

t See Atomic Physics, Ch. V. 3, p. 123; A. XV, p. 305. 
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eliminate them from the theory and introduce instead the 
quantum frequencies v nm = A -1 (e n — e m ), while the orbits should 
be completely abandoned. 

This suggestion of Heisenberg has been much admired as the 
root of the success of quantum mechanics. Attempts have been 
made to use it as a guide in overcoming the difficulties which 
have meanwhile turned up in physics (in the application of 
quantum methods to field theories and ultimate particles); yet 
with little success. Now quantum mechanics itself is not free 
from unobservable quantities. (The wave-function of SchrO- 
dinger, for instance, is not observable, only the square of its 
modulus.) To rid a theory of all traces of such redundant con- 
cepts would lead to unbearable clumsiness. I think, though 
there is much to be said for cleaning a theory in the way recom- 
mended by Heisenberg, the success depends entirely on soientifio 
experience, intuition, and tact. 

The essence of the new quantum mechanics is the representa- 
tion of physical quantities by matrices, i.e. by mathematical 
entities which can be added and multiplied according to well- 
known rules just like simple numbers, with the only difference 
that the product is non-cdmmutative. Tor instance, the quan- 
tum conditions (8.27) can be transcribed as the commutation law 

qp—pq = ih = (8.30) 

The Hamiltonian form of mechanics can be preserved by re- 
placing all quantities by the corresponding matrices. In par- 
ticular the determination of stationary states can be reduced 
to finding matrices q,p for which the Hamiltonian H(p, q) 
as a matrix has only diagonal elements which are then the energy- 
levels of the states. In order to obtain the connexion with 
Planck’s theory of radiation, the squares \q(m, n) | a have to be 
interpreted as Einstein’s coefficients B mn . In this way a few 
simple examples could be satisfactorily treated. But matrix 
mechanics applies obviously only to closed systems with discrete 
energy-levels, not to free particles and collision problems. 

This restriction was removed by SchrOdinger’s wave mechan- 
ics which sprang quite independently from an idea of de Broglie 
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■without recourse to 0 wJ^ d ™* nml laws b 7 Pm» reason, 
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time, whose other 00 ^ 0 ^ C ° m P 0nent of * ™ctor in space- 
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the fourth component °J * ° f & plane harmonic wave is also 
components represent the w Space ' tlme vector > whose other 

of the wave normal and theT I+if ^ ^ having tbe direction 

length). Now if Planck postuiSLtV-A 6 " 6 A * 

to assume also n = hk uu. , u * hv > one 18 compelled 
al-ady been done by Ei ESSr™*? ^ = * ^ had 
like darts with the m^LT °* ph ° t0nS beha ™ g 
applied it to electrons where the 1^ ° T ^ D ® BrogKe 
more complicated, namely obtS,S ^ ^ e and * is 

the velocity v: ^ rom ( 8 - 8 ) by elimi nating 

(c) =p8 + m o 2c *- (8.31) 

phase v ®lodty (rf^he^^ve^H^^^^^g ^ 

vA =v/k = e / p==c y v>C) 

apparently an impossible result +v, ‘ ^ 

excludes velocities larger thjf ihTt ° f relativit 7 

was not deterred by this- ^ f ^ But de Broglie 
velocities larger than c refers onlvto^ ^ probibitioa of 

used for sending time- signals Th ® u . cb . motlons which can be 
V ° ak ‘ That is impossible by means 
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of a monochromatic wave. For a signal one must have a small 
group of waves, the velocity of which can be obtained, according 
to Rayleigh, by differentiating frequency with respect to wave 
number. Thus, from (8.31) and (8.32), j* 


dv de pc 2 

dJc~ dp ~~ € 


(8.33) 


a most satisfactory result which completely justifies the formal 
connexion of particles and waves, though the physical meaning 
of this connexion was still mysterious. 

This reasoning is indeed a stroke of genius, yet not a triumph 
of a priori principles, but of an extraordinary capacity for com- 
bining and unifying remote subjects. 

I should say the same about the work of Schr 5 dinger and Dirac, 
but you could better ask them directly what they think about the 
roots of their discoveries. I shall not describe them here in detail, 
but indicate some threads to other facts or theories. SchrOdinger 
says that he was stimulated by a remark of de Broglie that any 
periodic motion of an electron must correspond to a whole 
number of waves of the corresponding wave motion. This led 
him to his wave equation whose eigenvalues are the energy- 
levels of stationary states. He was further guided by the analogy 
of mechanics and optics known from Hamilton’s investigations; 
the relation of wave mechanics to ordinary mechanics is the 
same as that of undulating optics to geometrical optics. Then, 
looking out for a connexion of wave mechanics with matrix 
mechanics, SchrOdinger recognized as the essential feature of a 
matrix that it represents a linear operator acting on a vector 
(one-column matrix), and came in this way to his operator cal- 
culus (see Appendix, 27) ; if a coordinate q is taken as an ordinary 
variable and the corresponding momentum as the operator 



the commutation law (8.30) becomes a trivial identity. Apply- 
ing the theory of sets of ortho-normal functions, he could then 
establish the exact relation between matrix and wave mechanics. 


f So© Atomic Physics, Ch. IV. 5, p. 84; A. XI, p. 295. 
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me wnoie QTiOTy nas been developed 
by Dirac from Heisenberg’s first idea by an independent and 
formally more general method based on the abstract concept of 
non-commuting quantities (g-numbers). 

The growth of quantum mechanics out of three independent 
roots uniting to a single trunk is strong evidence for the inevita- 
bility of its concepts in view of the experimental situation. 

Prom the standpoint of these lectures on cause and chance it 
is not the formalism of quantum mechanics but its interpretation 
whrch is of importance. Yet the formalism came first, and was 
well secured before it became clear what it really meant: nothing 
more or less than a complete turning away from the predomi- 
nance of cause (in the traditional sense, meaning essentially 
determinism) to the predominance of chance 

goes back to a tentative interpreta- 
nhotons TT f 11 °^ tbe existence of light waves and 
no ordinal f ^ WaV6S being a tghost field ’ w tich has 

^e nrebS-f y f 1 but wbose intens % determines 

appearance of photons. This idea could be 
transferred to the relation /„„j 


o ™ s waves. With the help of SchrOdin^er’s 

tirrr^tor“ t ^f^ dMiyob8ta!,e8 '‘ ho ' B[ “ ita - 

° f atom ‘ ™ der elytron bombardment, and other 

ztezr 18 “ ^h r 

“ o' «“> in a 

" *1- with ^ 
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QUANTUM MECHANICS 

In quantum mechanics physical quantities or observables are 
not represented by ordinary variables, but by symbols which 
have no numerical values but determine the possible values of 
the observable in a definite way to be described presently. These 
symbols can be added and multiplied with the proviso that 
multiplication is non-commutative: AB is in general different 
from BA. I cannot deal with the most general aspect of this 
symbolic calculus, but shall consider a special representation, 
namely that where the coordinates of the particles are 

regarded as ordinary numbers. Then a definite state of a system 
is defined by a function ifs(q l9 q 2 ,...), and an observable A can be 
represented by a linear operator: A^(q) means a new function 
<j>(q), the result of operating with A on «/r. If this result is, apart 
from a factor, identical with i/j, 

Ails = <4, (9.1) 

ijs is called an eigenfunction of A and the constant a an eigenvalue. 
The whole sot of eigenvalues is characteristic for the operator 
A and represents the possible numerical values of the observable, 
which may bo continuous or discontinuous. 

The coordinates q themselves can be considered to be opera- 
tors, namely multiplication operators: </ a operating on $ 
means multiplying 1 p by r/ rt . Operators whoso eigenvalues arc 
all real numbers arc called real (or ‘Hermitian’) operators. It 
is clear that all physical quantities have to be represented by real 
operators, as the eigenvalues ore supposed to represent the pos- 
sible results of measuring a physical quantity. One can easily 
see that not only the multiplication operators <j a but also the 
h 8 

momenta 7 )„ = — are real. But for the formal argument one 

* ^-/a 

can also use complex, operators, of the form C = (where 

i -z V— 1), and its conjugate <7* = A— iB; then CC* can be 
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shown to be a real operator with only positive (or zero) eigen- 
values. 

If two observables are represented by non-commuting opera- 
tors, A and B, their eigenfunctions are not all identical; if a is 
an eigenvalue of A belonging to such an eigenfunction, there is 
no state of the system for which a measurement can result in 

finding simultaneously for A and B sharp numerical values a 
and b. 

The theory cannot therefore in general predict definite 
values of all physical properties, but only probability laws. 
The same experiment, repeated under identical and controllable 
oon tions, may result in finding for a quantity A so many 
imes Oj, so many times a 2 , etc., and for B in the same way b x 

OT a- ??’, But the average of repeated measurements must be 
predictable. Whatever the rule for constructing the number 
w c represents the average A of the measurements of A, it 
mustjjy common sense, have the properties that 3+35 = AA-B 
and cA = cA, if c is any number. 

ap^Tfro^i! 10316 th6re f - U - WS 811 result. Consider 

V 0f operators A - * *Ko «><*• 

meansquaredenatioiis^orfche spreadings’ of the measurements, 

sb-J(B= 2 n (9.2) 

• nsT the'S’Tt Appendix, 28). which 


(9.3) 

where rj m 1 . . _ „ 

= (9.4) 

is the so-called Commutator’ nf 4 -™* 
specially auulied tnn j. wo °P e rators A , B. If this 

- « 2 m : rtsr* — ^ - » 

*-*>r (».<>) 

Heisenberg s celebrated uncertainty principle which is 
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a quantitative expression for the effect of non-commutation on 
measurements, but independent of the exact definition of aver- 
ages. It shows how a narrowing of the range for the measured 
g-values widens the range for p. The same holds, according to 
(9.3), for any two non-commuting observables with the differ- 
ence that the ‘uncertainty’ depends on the mean of the commu- 
tator. 

These general considerations are, so to speak, the kinematical 
part of quantum mechanics. Now we turn to the dynamical 
part. 

Just as in classical mechanics, the dynamical behaviour of a 
system of particles is described by a Hamiltonian 

B(q l9 q 29 ...;p v p 2> ...) 

which is a (differential) operator. It is usually just taken over 
from classical mechanics (where, if necessary, products like pq 
have to be ‘symmetrized 5 into Um+QP))* I 11 Dirac’s relativistic 
theory of the electron there are, apart from the space coordinates, 
observables representing the spin (and similar quantities in 
meson theory); they lead to no fundamental difficulty and will 
not be considered here. 

Yet one remark about the Hamiltonian H has to be made, 
bearing on our general theme of cause and chance: H contains 
in the potential energy (and in corresponding electromagnetic 
interaction terms) the last vestiges of Newton’s conception of 
force, or, using the traditional expression, of causation. We 
have to remember this point later. 

In classical mechanics we have used a formulation of the laws 
of motion which applies just as well to a simple system, where all 
details of the motion are of interest, as to a system of numerous 
particles, where only statistical results are desired (and possible). 
A ftmction/(J,p,< 7 ) of time and of all coordinates and momenta 
was considered; if p,q change with time according to the equa- 
tions of motion, the total change of / is given by 


— = ——\H f] 
dt 8t 1 ,Ji ’ 


(9.6) 
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•where [if,/] is the Poisson bracket 



83_df_ 
Hk 8 Pk 


8H jn 
dPkfyJ' 


(9.7) 


One recovers the canonical equations by taking for / 
simply q k or p k respectively. On the other hand, if one puts 
df/dt = 0, any solution of this equation is an integral of the 
equations of motion, and from a sufficient number of such 
integrals J k {t,p, q) = c k one can obtain the complete solution 
giving all p, q as functions of t. 

But if this is not required, the same equation is also the means 
for obtaining statistical information in terms of a solution f, 
called the ‘distribution function’, as I have described in detail. 
/ is that integral of 

f = (9.8) 


which for t = 0 goes over into a given initial distribution/,,^, q). 
If, in particular, this latter function vanishes except in the 
neighbourhood of a given point p g , q 0 in phase space , or, in Dirac ’ b 
notation, if / 0 = 8(p—p 0 )S(q—q 0 ), one falls back to the case of 
complete knowledge, q 0 and p 0 being the initial values of q and p. 

This procedure cannot be transferred without alteration to 
quantum mechanics for the simple reason that p and q cannot 
be simultaneously given fixed values. The uncertainty relation 
(9.5) forbids the prescribing of sharp initial values for all p and q. 
Hence the first part of the programme, namely a complete 
knowledge of the motionin the same sense as in classical mechan- 
ics, breaks down right from the beginning. Yet the second part, 
statistical prediction, remains possible. Following Dirac, we 
ask which quantities have to replace the Poisson brackets (9.7) 
m quantum theory, where all quantities are in general non- 
commuting. These brackets [a, /l] have a number of algebraic 
properties; the most important of them being 


[“> — [“>&]+ [a, j8 a ], 

[“,&&] = &[«.&]+[<*,&]&. <9 - 9) 

T th6Se 8ta11 h0ld also for non-commuting 
quantities « and ft provided the order of factors is always 
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reserved (as it is in (9.9)), then it can be shown (Appendix, 29) 
hat [a,/?] is exactly the commutator as defined by (9.4). 

Now one has to replace the function / in (9.8) by a time- 
Lependent operator p, called the statistical operator, and to 
Letermine p from the equation (formally identical with (9.8)): 

| = [#/>] (9.10) 

rith suitable initial conditions. To express these in a simple 
vay it is convenient to represent all operators by matrices in 
he g-space; A operating on a function i/j(q) is defined by 

At(q) = J A(q,q’Mq') dq' (9.11) 

9. stands for aU coordinates and q' for another set of 

rallies where A(q,q') is called the matrix represent- 

ing 

The product AB is represented by the matrix 

AB(q, q') = J A(q, q")B(q", q’) dq". (9.12) 

A now p and H are taken as such matrices, where the elements 
>f p depend also on time, (9.10) is a differential equation for 
an d the initial conditions are simply 


P(0,?,2') = A>(?>2'), (9-13) 

vhere p 0 is a given function of the two sets of variables. 

The number of vector arguments in p for a system of N par- 
doles is 2 N, exactly as in the case of classical theory in the func- 
tion f{p 9 q ). But while the meaning of / depending on p,q is 
obvious, that of p depending on two sets g, g' is not, except in 


>ne case, namely when the two sets are identical, g = g'; then 


}he function 


p{t 9 q,q) = n(t,q) 


(9.14) 


s the number density, corresponding to the classical 

jf(t,q,p)dp = n(t,q). 

Juite generally, the classical operation of integrating over the p ’ s 
.& replaced by the simpler operation of equating the two sets 
>f q’ s, q = g', or in matrix language, taking the diagonal elements 
xf p. 

5131 „ 
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The average of an observable A for a configuration q must be 
a real number A formed from p and A so that nA is linear in both 
operators. The simplest expression of this kind is 

nA = UpA+Ap)^, (9.15) 

and this gives, in fact, all results of quantum mechanics usually 
obtained with the help of the wave function. For instance, the 
statistical matrix describing a stationary state where A has a 
sharp value a, belonging to the eigenfunction q), is 

p = +(a,q)t*(a,q'). (9.16) 

Then, from the definition (9.12) it follows easily that for this p 
and any real operator A one has 

Ap = pA = ap, (9.17) 

hence for q = q', with (9.14), 

n(a,q) = |^(a,?)|®, 1 = a. (9.18) 

Thus we have obtained the usual assumption that \^(a, g) | 2 
is the ‘probability’ (if normalized to 1) or ‘number density* 
(if normalized to N) at the point q for the state a. (It must, 
however, be noted that for systems of numerous particles, like 
liquids in motion, other ways of averaging are useful, for instance 
for the square of a momentum instead of np% = \(pp i -\-p z p) a =q' 
the expression ^(pp ij rp 2 p j t-2ppp) ai _ l} ,, which, however, for uni- 
form conditions coincides with the former.) 

Let us consider the general stationary case where p is inde- 
pendent of time and therefore satisfies 

[2» p] = 0. (9.19) 

Any solution of this equation, i.e. any quantity A which com- 
mutes with H, is called an integral of the motion, in analogy to 
the corresponding classical conception. H itself is, of course, an 
integral. All integrals A 1} A 2 ,..., have different eigenvalues 
Al Aj,..., for one and the same eigenfunction ifi( A 2 ,...; q lt q 2 ,...), 
or shortly ^(A, q ) : 

A 1 ^ = A 1 ^-, A t ifi = X 2 if>, ... . (9.20) 

/> can be taken as any function of the A’s; its matrix representa- 
tion is given by 

p(s,?') = 2-P(Am?¥*(A,g'), 


( 9 . 21 ) 
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from which one obtains, with (9.18), 

»(«) = PM = 2 P(WM\* = 2 P(A)»(A,g). (9.22) 
A A 

This shows that the arbitrary coefficient P(\) is the probability 
of fin di n g the system in the stationary state A. 

Dynamical problems arise in a somewhat different way from 
those in classical theory. There it has a definite meaning to 
speak about the motion of particles in a closed system, for 
instance of the orbit of Jupiter in the planetary system. In 
quantum theory a closed system settles down in a d efinit e 
stationary state, or a mixture of suoh states as given by (9.21). 
But then nothing is changing in time; one cannot even make an 
observation without interfering with the state of the system. 
In classical physics it is supposed that we have to do with an 
objective and always observable situation; the process of measur- 
ing is assumed to have no influence on the object of observation. 
I have, however, drawn your attention to the point that even 
in classical physics this postulate is practically never fulfilled 
because of the Brownian motion which affects the instruments. 
W© are therefore quite prepared to find that the assumption of 
‘harmless’ observations is impossible. 

The most general way of formulating a dynamical problem is 
to split the Hamiltonian in two parts 

H = H 0 +V, (9.23) 
where Hq describes what is of interest while V is of minor impor- 
tance, a so-called perturbation, V may also include external 
influences and depend explicitly on the time. This partition is, 
of course, arbitrary to a high degree; but it corresponds to the 
actual situation. If a water molecule H 2 0 is assembled from its 
atoms, one can either ask what the stationary states of the whole 
system are, or one can consider the parts H 2 and O and ask how 
the states of the hydrogen molecule H 2 are changed by the 
approaching oxygen atom, or one can ask the same question 
for the HO radical and the H atom. The latter two are dynamical 
problems. 

Dynamical problems in quantum theory therefore, in contrast 
to those in classical theory, cannot be defined without a sub jective, 
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more or less arbitrary decision about what you are interej 
in. In other words, quantum mechanics does not describe 
objective state in an independent external world, but the an] 
of this world gained by considering it from a certain subjoc - 
standpoint, or with certain experimental means and arrar 
ments. This statement has produced much controversy, i 
though it is generally aocepted by the present generatior 
physicists it has been decidedly rejected by just those two r 
who have done more for the creation of quantum physios t] 
anybody else, Planck and Einstein. Yet, with all rospecl 
cannot agree with them. In fact, the assumption of absol 
observability which is the root of the classical concepts seomi 
me only to exist in imagination, as a, postulate which oannol 
satisfied in reality. 

Assuming the partition (9.23) one has to describe the syston 
terms of the integrals of motion A x , A a ,... of H 0 which are, h< 
ever, not integrals of motion of H. All operators are then to 
expressed as matrices in the eigenvalues A (A ls A 2 ,...) of A x , A a 
for instance, the statistical operator p by the matrix p(t; A, 
The diagonal elements of this matrix 


P(t;X) = P (t; A, A) (!). 

represent the probability of a state A at time t, and they go o 
for t 0 into the coefficients P(A) which appear in the expanw 
(9.21) and represent the initial probabilities. The funot 
p(t; A, A') can be determined from the differential equation (9. 
by a method of successive approximations. My collaborat 
Green, has even found an elegant formula representing the co 
plete solution. To a second approximation one find? 

P(*,A) = P(A)+Y J(A,A'){P(A')-P(A)}+...; (9.: 

A 

the coefficients are given by 




(9.: 


J F(<; A, A y-tmv-untdt 
I 0 

® n ! rg f ° f the ^Perturbed system in the state 
A that in the state A' (see Appendix, 30). 
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Now equation (9.25) has precisely the form of the laws of radio- 
active decay, or of a set of competing mono-molecular reactions. 
The matrix J(X, A') obviously represents the probability of a 
transition or jump from the state X to the state A'. This inter- 
pretation becomes still more evident if one assumes that the 
A- values are practically continuous, as would be the case if the 
system allowed particles to fly freely about (for instance in radio- 
activity one has to take account of the emitted a-partioles; in the 
theory of optical properties of an atom of the photons emitted 
and absorbed). If external influences are excluded, so that V 
does not depend on time, the integral (9.26) can be worked out 
with the result that J becomes proportional to the time 

J(A,A') =j(X,X')t, (9.27) 

where j(X,X') = ^\V(X,X')\*8(E-E'). (9.28) 

n 

The last factor S(E-E') says that j( A, A') differs from zero for 
two states A and A' only if their energy is equal. j'( A, A') is 
obviously the transition probability per unit time, precisely 
the quantity used in radioactivity. 

By applying the formula (9.25) to the case of the interaction 
of an atom with an electromagnetic field one obtains the formula 
(8.24) which was used by Einstein in his derivation of Planck’s 
radiation law. There are innumerable similar applications, such 
as the calculation of the effective cross-sections of various kinds 
of collision processes, which have provided ample confirmation 
of the formula (9.25). 

^DETERMINISTIC PHYSICS 

There is no doubt that the formalism of quantum mechanics 
and its statistical interpretation are extremely successful in 
ordering and predicting physical experiences. But can our 
desire of understanding, our wish to explain things, be satisfied 
by a theory which is frankly and shamelessly statistical and 
indeterministic ? Can we be content with accepting chance, not 
cause, as the supreme law of the physical world? 

To this last question I answer that not causality, properly 
understood, is eliminated, but only a traditional interpretation 
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Of It, consisting in its identification with determinism. I have 

c a a :;r 8 *° J 1 * 18 * ttese tw ° concepts are not identical. 

Causality m my definition is the postulate that one physical 
situation depends on the other, and causal research means the 
covery of such dependence. This is still true in quantum 
ysics, ough the objects of observation for which a depen- 
dalmed « a*™* they are the probabilities of 
T t ^ even ^ s » no * those single events themselves. 

matrix P > from ^ch these probabihties 
- . , V ’ satlsded a differential equation which is essentially 
1 , 6 same iyP e as the classical field equations for elastic or 

WaY6S - ' For Stance, if one multiplies the 
g nfunction if,(q) 0 f the Hamiltonian H, Hdi = JEJt by e**** 

the new function satisfies V 




(9.29) 


For a free particle, where B = l-( p l +p * +pl) = _^ A>(9 . 29) 
goes over into the wave equation 


2midJ> 

-Tst=^- 


(9.30) 


funea^if 1 ? 6 Sj &St deriva tive with respect to time 
ppears, it does not differ essentially from the ordinary wave 

equation ^where the left-hand side is ^ j . One must remember 

where^* ttt' 7 ^ & physical meanin g (as a probability), 

where ** satisfies the conjugate complex equation 

2mi8< 4* 

^aispM- of equations nohimge in the time direction ((-* _() 

♦ - *>. which his no to! 
“ft* we see that the 
^ S " al fi6ld *h»‘ principle of antSJLi 
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is violated: there is no distinction between past and future for 
the spreading of the probability density. On the other hand, the 
principle of contiguity is obviously satisfied. 

The differential equation itself is constructed in a way very 
similar to the classical equations of motion. It contains in the 
potential energy, which is part of the Hamiltonian, the classical 
idea of force, or in other words, the Newtonian quantitative 
expression for causation. If, for instance, particles are acting 
on one another with a Coulomb force (as the nucleus and the 
electrons in an atom), there appears in H the same timeless action 
over finite distance as in Newtonian mechanics. Yet one has the 
feeling that these vestiges of classical causality are provisional 
and will be replaced in a future theory by something more satis- 
factory ; in fact, the difficulties which the application of quantum 
mechanics to elementary particles encounters are connected with 
the interaction terms in the Hamiltonian; they are obviously still 
too ‘classical’. But these questions are outside the scope of my 
lectures. 

We have the paradoxical situation that observable events obey 
laws of chance, but that the probability for these events itself 
spreads according to laws which are in all essential features 
causal laws. 

Here the question of reality cannot be avoided. What really 
are those particles which, as it is often said, can just as well 
appear as waves ? It would lead me far from my subject to discuss 
this very difficult problem. I think that the concept of reality is 
too much connected with emotions to allow a generally accept- 
able definition. For most people the real things are those things 
which are important for them. The reality of an artist or a poet 
is not comparable with that of a saint or prophet, nor with that 
of a business man or administrator, nor with that of the natural 
philosopher or scientist. So let me cling to the latter kind of 
special reality, which can be described in fairly precise terms. 
It presupposes that our sense impressions are not a permanent 
hallucination, but the indications of, or signals from, an external 
world which exists independently of us. Although these signals 
change and move in a most bewildering way, we are aware of 
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Objects with invariant properties. The set of these invariants of 
our sense impressions is the physical reality which our mind 
constructs in a perfectly unconscious way. This chair here looks 
different with each movement of my head, each twinkle of my 
eye, yet I perceive it as the same chair. Science is nothing else 
than the endeavour to construct these invariants where they are 
not obvious. If you are not a trained scientist and look through 
a microscope you see nothing other than specks of light and 
colour, not objects; you have to apply the technique of biological 
science, consisting in altering conditions, observing correlations, 
etc., to learn that what you see is a tissue with cancer cells, or 
something like that. The words denoting things are applied to 
permanent features of observation or observational invariants. 

In physics this method has been made precise by using mathe- 
matics. There the invariant against transformation is an exact 
•^ e ^ n i* 1 his celebrated Erlanger Programm has 
classified the whole of mathematics according to this idea, and 
the same could be done for physics. 

Ikom this standpoint I maintain that the particles are real, 
as they represent invariants of observation. We believe in the 
existence’ of the electron because it has a definite charge e and a 
efimte mass m and a definite spin s; that means in whatever 
circumstances and experimental conditions you observe an effect 
which theory ascribes to the presence of electrons you find for 
th^ quantities, e, m, s, the same numerical values. 

Whether you can now, on account of these results, imagine 
the electronlike a tiny grain of sand, having a definite position in 

^ 0th6T matter * ** fact you can, even in quantum 
eory. What you cannot do is to suppose it also to have a 
definite velocity at the same time ; that is impossible according to 
the uncertainty relation. Though in our everyday experience 
we can ascribe to ordinary bodies definite positions and velocities, 

SrrrT* afl8Umethe ** dimensions which are 
below the limits of everyday experience. 

Ration and velocity are not invariants of observation. But 

themZo v. the id6a 0f a P article > and we must use 

them as soon as we have made up our minds to describe certain 
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phenomena in terms of particles. Bohr has stressed the point that 
our language is adapted to our intuitional concepts. We cannot 
avoid using these even where they fail to have all the properties 
of ordinary experience. Though an electron does not behave like 
a grain of sand in every respect, it has enough invariant pro- 
perties to be regarded as just as real. 

The fact expressed by the uncertainty relation was first dis- 
covered by interpreting the formalism of the theory. An 
explanation appealing to intuition was given afterwards, namely 
that the laws of nature themselves prohibit the measurement 
with infini te accuracy because of the atomic structure of matter: 
the most delicate instruments of observation are atoms or 
photons or electrons, hence of the same order of magnitude as 
the objects observed. Niels Bohr has applied this idea with great 
success to illustrate the restrictions on simultaneous measure- 
ments of quantities subject to an uncertainty rule, which he calls 
‘ complementary 5 quantities. 

One can describe one and the same experimental situation 
about particles either in terms of accurate positions or in terms 
of accurate momenta, but not both at the same time. The two 
descriptions are complementary for a complete intuitive under- 
standing. You find these things explained in many text-books 
so that I need not dwell upon them. 

The adjective complementary is sometimes also applied to 
the particle aspect and the wave aspect of phenomena — I t hink 
quite wrongly. One can call these ‘dual aspects’ and speak of a 
‘duality’ of description, but there is nothing complementary as 
both pictures are necessary for every real quantum phenomenon. 
Only in limiting cases is an interpretation using particles alone 
or waves alone possible. The particle case is that of classical 
mechanics and is applicable only to the case of large masses, 
e.g. to the centre of mass of an almost closed system. The wave 
case is that of very large numbers of independent particles, as 
illustrated by ordinary optics. 

The question of whether the waves are something ‘real’ or a 
fiction to describe and predict phenomena in a convenient way 
is a matter of taste. I personally like to regard a probability 
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The intensity, i.e. the probability of finding photons on the 
screen, in the case of both slits open, is therefore not a simple 
superposition of those obtained when only one of the slits is 
open. This is at once understandable if you use the picture of 
probability waves determining the appearance of photons. For 
the spreading of the waves depends on the whole arrangement, 
and there is no miracle in the effect of shutting one slit. Yet if 
you try to use the particles alone you get into trouble; for then a 
particle must have passed one slit or the other and it is perfectly 
mysterious how a slit at a finite distance can have an influence on 
the diffraction pattern. Reichenbach, who has published a very 
thorough book on the philosophical foundations of quantum 
mechanics, speaks in such cases of "causal anomalies’. To avoid 
the perplexity produced by them he distinguishes between 
phenomena, i.e. things really observable, such as the appearance 
of the photons on the screen, and ‘inter-phenomena’ , i.e. theoreti- 
cal constructions about what has happened to a photon on its 
way, whether it has passed through one slit or the other. He 
states rightly that the difficulties arise only from discussing 
inter-phenomena. ‘That a photon has passed through the slit 
B 1 is meaningless as a statement of a physical fact.’ If we want 
to make it a physical fact we have to change the arrangement 
in such a way that the passing of a photon through the slit B x can 
be really registered; but then it would not fly on undisturbed, 
and the phenomenon on the screen would be changed. Reichen- 
bach’s whole book is devoted to the discussion of this type of 
difficulty. I agree with many of his discussions, though I object 
to others. For instance, he treats the interference phenomenon 
of two slits also in what he calls the wave interpretation; but here 
he seems to me to have misunderstood the optical question. In 
order to formulate the permitted and forbidden (or meaningless) 
statements he suggests the use of a three-valued logic, where the 
law of the ‘excluded middle’ ( tertium non datur) does not hold. 
I have the feeling that this goes too far. The problem is not one 
of logic or logistic but of common sense. For the mathematical 
theory, which is perfectly capable of accounting for the actual 
observations, makes use only of ordinary two-valued logics. 
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Difficulties arise solely if one transcends actual observations and 
insists on nging a special restricted range of intuitive images 
n-nfl corresponding terms. Most physicists prefer to adapt their 
imfl.ginfl.tinn to the observations. Concerning the logical pro- 
blem itself, I had the impression when reading Reichenbach’s 
book that in explaining three-valued logic he constantly used 
ordinary logic. This may be avoidable or justifiable. I remember 
the days when I was in daily contact with Hilbert, who was 
working on the logical foundations of mathematics. Ho dis- 
tinguished two stages of logics: intuitive logic dealing with finite 
sets of statements, and formal logic (logistics), which he described 
as a game with meaningless symbols invented to deal with the 
infinite sets of mathematics avoiding contradictions (like that 
revealed in Russell’s paradox). But Gtidel showed that these 
contradictions crop up again, and Halbert’s attempt is to-day 
generally considered a failure. I presume that three-valued 
logic is another example of such a game with symbols. It is 
certainly entertaining, but I doubt that natural philosophy will 
gain much by playing it. 


T h inking in terms of quantum theory needs some effort and 
considerable practice. The clue is the point which I have 
stressed above, that quantum mechanics does not describe a 
situation in an objective external world, but a definite experimen- 
tal arrangement for observing a section of the external world. 
Without this idea even the formulation of a dynamical problem 
m quantum theory is impossible. But if it is accepted, the funda- 
mental indeterminacy in physical predictions becomes natural, 

as no experimental arrangement can ever be absolutely precise. 

,,, ^ even most fervent determinist cannot deny 
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seaming the history of physics in the way we have done we see 
fluctuations and vacillations, but hardly a reversion to more 
primitive conceptions. I expect that our present theory will be 
profoundly modified. For it is full of difficulties which I have not 
mentioned at all — the self-energies of particles in interaction 
and many other quantities, like collision cross-sections, lead to 
divergent integrals. But I should never expect that these 
difficulties could be solved by a return to classical concepts. I 
expect just the opposite, that we shall have to sacrifice some 
current ideas and use still more abstract methods. However, 
these are only opinions. A more concrete contribution to this 
question has been made by J. v. Neumann in his brilliant book; 
Mathematische Orundlagen der Quantenmechanik . He puts the 
theory on an axiomatic basis by deriving it from a few postulates 
of a very plausible and general character, about the properties 
of ‘expectation values’ (averages) and their representation by 
mathematical symbols. The result is that the formalism of 
quantum mechanics is uniquely determined by these axioms; in 
particular, no concealed parameters can be introduced with the 
help of which the indeterministic description could be trans- 
formed into a deterministic one. Hence if a future theory 
should be deterministic, it cannot be a modification of the present 
one but must be essentially different. How this should be 
possible without sacrificing a whole treasure of well-established 
results I leave to the determinists to worry about. 

I for my part do not believe in the possibility of such a turn 
of things. Though I am very much aware of the shortcomings 
of quantum mechanics, I think that its indeterministic founda- 
tions will be permanent, and this is what interests us from the 
standpoint of these lectures on cause and chance. There remains 
now only to show how the ordinary, apparently deterministic 
laws of physics can be obtained from these foundations. 

QUANTUM KINETIC THEORY OF MATTER 

The main problem of the classical kinetic theory of matter 
was how to reconcile the reversibility of the mechanical motion 
of the ultimate particles with the irreversibility of the thermo- 



110 


CHANCE 


dynamical laws of matter in bulk. This was achieved by pro- 
claiming a distinction between the true laws which are strictly 
deterministic and reversible but of no use for us poor mortals 
with our restricted means of observation and experimentation, 
and the apparent laws which are the result of our ignorance and 
obtained by a deliberate act of averaging, a kind of fraud or 
falsification from the rigorous standpoint of determinism. 

Quantum theory can appear with a cleaner conscience. It 
has no deterministic bias and is statistical throughout. It has 
accepted partial ignoranoe already on a lower level and need 
not doctor the final laws. 

In order to define a dynamical phenomenon one has, as we have 
seen, to split the system in two parts, one being the interesting 
one, the other a ‘perturbation’; and this separation is highly 
arbitrary and adaptable to the experimental arrangement to be 
described. Now this circumstance can be exploited for the pro- 
blem of thermodynamics. There one considers two (or more) 
bodies first separated and in equilibrium, then brought into 
contact and left to themselves until eq uili brium is again, 
attained. 

Let E W be the Hamiltonian of the first body, H®> that of the 
second, and write 

So = HW+H®. (0.31) 


Then this is the combined Hamiltonian of the separated bodies. 

H they are brought into contact the Hamiltonian will be differ- 
ent, namely 

H = H 0 +V, (9.32) 


I*®” 7 interaction, which for ordinary matter in bulk will 

of surface forces. Now (9.32) has exactly the form of the 
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by H 0 . Hence we can use the solution of the dynamical problem 
given before, namely (9.25), 

P(t, A) = P(A)+ £ J-( A, A'){P(A')-P(A)}+..„ (9.33) 

where now A represents the sets of eigenvalues A (1) = (Af>, A^ x) ,...) 

of Af>, AfJ,..., and A® = (A^A^,...) of A£ a >, 

Let us consider first statistical equilibrium. Then 

P(t, A) = P(A); 

hence the sum must vanish, and one must have 

P(A') = P(A) (9.34) 

for any two states A, A' for which the transition probability 
J( A, A') is not zero. But we have seen further that these quan- 
tities «7( A, A') are in all practical oases proportional to the time 
and vanish unless the energy is conserved, E = E' (see formulae 
9.27, 9.28). If we disregard cases where other constants of motion 
exist for which a conservation law holds (like angular momentum 
for systems free to rotate), one can replace P(A) by P(E). But 
as the total system consists of two parts which are practically 
independent, one has 

P(E) = P(A) = P 1 (A< 1 ))P 2 (A< 2 )), (9.36) 

where the two factors represent the probabilities of finding the 
separated parts initially in the states A (1) and A (2) . This factoriza- 
tion need not be taken from the axioms of the calculus of pro- 
bability; it is a consequence of quantum mechanics itself. For 
if the energy is a sum of the form (9.31), the exact solution of the 
fundamental equation for the density operator 

| = (9.36) 

is p = Pi p 2 , where p t refers to the first system 27 (1) , p. 2 to the 
second as according to (9.24) P(t, A) = p(t\ A, A), the product 
formula (9.35) holds not only for the stationary case (as long as 
the interactions can be neglected). If now P (1) (A <U ) and P (2) (A (2) ) 
are the energies of the separated parts, one obtains from (9.35) 

P(P(i)+P<a>) = P 1 (A<«)P a (A®), (9.37) 
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which, is a functional equation for the three functions P, -Fj.,Pa* 
The solution is easily found to be (see Appendix, 31) 

p = p x = epu-PBi, p 2 _ e ott-PE* t (9.38) 

with a = ai+aa, = P 1 +P 2 (9.39) 


and the same p in all three expressions. 

Thus we have found again the canonical distribution of Gibbs, 
with the modification that the energies appearing are not explicit 
functions of q and p (Hamiltonians) but of the eigenvalues 
A, A®, A® of the integrals of motion. 

THr derivation is obviously a direct descendant of Maxwell’s 
first proof of his velocity distribution law which we discussod 
previously, p. 51. But while the argument of independence is not 
justifiable with regard to the three components of velocity, it is 
perfectly legitimate for the constants of motion A. The fact 
that the multiplication law of probabilities and the additivity 
of energies for independent systems leads to the exponential 
distribution law has, of course, been noticed and used by many 
authors, beginning with Gibbs himself. This reasoning becomes, 
with the help of quantum mechanics, an exact proof which shows 
the limits of validity of the results. For if there exist constants 
of motion other than the energy, the distribution law has to 
be modified, and therefore the whole of thermodynamics. This 
happens for instance for bodies moving freely in space, like stars, 
where the quantity /? = 1 JkT is no longer a scalar but the time 
component of a relativistic four-vector, the other components 
representing 0v, where v is the mean velocity of the body. Yet 
this is outside the scope of these lectures. 


The simplest and much discussed application of quantum 
tistics is that to the ideal gases. It was Einstein who fix*st 
noticed that for very low temperatures deviations from the 
dassical laws should appear. The Indian physicist, Boso, had 
shown that one can obtain Planck’s law of radiation by rogard- 
g the radiation as a ‘photon gas’ provided one did not treat 
nletdt ^cognizable particles but as 00 m- 

mlteLa^ able ‘ EinStein tra ^rred this idea to 

ms. Later it was recognized that this so-called Boso- 
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Einstein statistics was a straightforward consequence of quan- 
tum mechanics; about the same time Fermi and Dirac discovered 
another similar case which applies to electrons and other particles 
with spin. 

In the language used here the two ‘statistics’ can be simply 
characterized by the symmetry of the density function 

P(^l» ^1> ^-n)‘ 

It is always symmetric, for indistinguishable particles, in both 
sets of arguments, i.e. it remains unchanged if both sets are 
subject to the same permutation. If, however, only one set is 
permuted, p remains also unchanged in the Bose-Einstein case 
for all permutations, while in the Fermi-Dirac case it does so 
only for even permutations, and changes sign for odd permuta- 
tions. 

Applied to a system of free particles of equal structure, one 
obtains at once from the canonical distribution law the pro- 
perties of so-called degenerate gases. But as these are treated in 
many text-books, I shall not discuss them here (see Appendix, 
32 ). 

After having considered statistical equilibrium we have now 
to ask whether quantum mechanics accounts for the fact that 
every system approaches equilibrium in time by the dissipation 
of visible energy into heat, or, in other words, whether the 
if -theorem of Boltzmann holds. 

This is the case indeed, and not difficult to prove. One defines 
the total entropy, just as in classical theory, by 

ZP(t, A)log P(t, A) 

— (9 ' 40> 

where the summations are to be extended over all values of the 
AjjAj,...; i.e. for each separate part of a coupled system over 

A^, Ajj 1 !,... and A^ 2) , A| 2) respectively, and for the whole system 

over both sets. For loosely coupled systems the probabilities 
are, as we have seen, multiplicative at any time: 

P(t; A<V< 2 >) = 

i 


5131 


( 9 . 41 ) 
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From this it follows easily that the entropies are additive, 

S^Sj.+Sz. (9.42) 

Now substitute into (9.40) the explicit expression for P(f,A) 
from (9.33) which holds for weak coupling; then by neglecting 
higher powers of the small quantities J( A, A') one obtains 

h 2/(A,A')<2(A,A') 

S = 8 o+2 ’ ^ P(A) ’ (9,43 ^ 

where Q( A, A') = {P(A)-P(A')}logi^. (9.44) 

The transition probabilities J (A, A') are, as we have seen, in all 

practical cases proportional to time and vanish for transitions, 

for which energy is not conserved; one has, according to (9.27) 

and (9.28), „ 

J(A,A') = <^|F(A,A')| 2 8(P-P'), (9.45) 

11 

where V is the interaction potential. These quantities J(A, A') 
are always positive. So is the denominator 2 P(A), while Q( A, A') 

is positive as long as P( A) differs from P(A'). 

Hence 8 increases with time and will continue to do so, until 
statistical equilibrium is reached. For only then no further 
increase of 8 will happen, as is seen by taking equilibrium as the 
initial state (where according to (9.34) Q( A, A') = 0 for all non- 
vanishing transitions). 

It remains now to investigate whether quantum kinetics 
leads, for matter in bulk, to the ordinary laws of motion and ther- 
mal conduction as formulated by Cauchy. This is indeed the 
case, as far as these laws are expressed in terms of stress, energy, 
and flux of matter and heat. Yet, as we have seen, this is only 
half the story, since Cauchy’s equations are rather void of moan- 
ing as long as the dependence of these quantities on strain, 
temperature, and the rate of their changes in space and time are 
not given. Now in these latter relations the difference between 
quantum theory and classical theory appears and can reach vast 
proportions under favourable circumstances, chiefly for low 
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temperatures. The theory sketched in the following is mainly 
due to my collaborator, Green. 

The formal method of obtaining the hydrothermal equations 
is very similar to that used in classical theory. Starting from the 
fundamental equation for N particles 

8 ~§ = [Z N>PN ], (9.46) 


a reduction process is applied to obtain s imil ar equations for 
N— 1, N— 2,..., particles, until the laws of motion for one particle 
are reached. 

The reduction consists, as in classical theory, in averaging 
over one, say the last, particle of a set. The coordinates of each 
particle appear twice as arguments of a matrix 


• Pn = p»( x(1) > X®,..., x«; X<«', X®',..., xW')- 


Put here x (re) = x (tt >' and integrate over x (n) ; the result is x n Pn > a 
matrix which depends only on x (1) ,..., x (n-1) ; x (1 >',..., x (w_1 >'. 
With the same normalization as in classical theory, (6.40), p. 67, 

we write ,, T , 

Xa+iPa+i = ( N ~ ?)/>«• ( 9 -47) 


By applying this operation several times to (9.46) one obtains 
(see Appendix, 33) 


^ = [H q ,p Q ]+8 q (q = 1 , 2 ,..., N), (9.48) 


where 8 a =£ x fl+1 p a+1 ], (9.49) 

in full analogy with the corresponding classical equations (6.44), 
(6.46). Here H q means the Hamiltonian of q particles, 
the interaction between one of these ( i ) and a further particle 
(g+1), and$ ff is called, as before, the statistical term. 

The quantity /> a (x, x) = n q (x) represents the generalized 
number density for a ‘cluster’ of q particles, and in particular 
w^x) is the ordinary number density. 

Now one can obtain generalized hydro-thermodynamical 
equations from (9.47) by a similar process to that employed in 
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classical theory. Instead of integrating over the velocities one 
has to take the diagonal terms of the matrices (putting x = x'), 
and one has to take some precautions in regard to non-commuta- 
tivity by symmetrizing products, e.g. replacing a/3 by £(a/?-f/Ja) 
(see Appendix, 33). Exactly as in the classical equations of 
motion there appears the average kinetic energy of the particle 
(i) in a cluster of q particles which, divided by £Jb, may be called 
a kinetic temperature of the particle (£) in the cluster of q par- 
ticles. One might expect that the quantity T x corresponds to the 
ordinary temperature; but this is not the case. 

It is well known from simple examples (e.g. the harmonic 
oscillator) that in quantum theory for statistical equilibrium the 
thermodynamic temperature T, defined as the integrating de- 
nominator of entropy, is not equal to the mean square momen- 
tum. Here in the case of non-equilibrium it turns out that not 
only this happens, but that a similar deviation occurs with regard 
to pressure. The thermodynamical pressure p is defined as the 
work done by compression for unit change of volume; the kin etic 
pressure j)jis the isotropic part of the stress tensorin the equations 
of motion. These two quantities differ in quantum theory. 

Observable effects produced by this difference occur only for 
extremely low temperatures. Eor gases these are so low that 
they cannot be reached at all because condensation takes place 
long before. Most substances are solid crystals in this region of 
temperature; for these one has a relatively simple quantum 
theory, initiated by Einstein, where the vibrating lattice is 
regarded as equivalent to a set of oscillators (the ‘normal modes’) . 
This theory represents the quantum effects in equilibrium 
(specific heat, thermal expansion) fairly well down to zero 
temperature, while the phenomena of flow are practically 
unobservable. 

There are only two cases where quantum phenomena of flow 
at very low temperatures are conspicuous. One is liquid helium 
which, owing to its small mass and weak cohesion, does not 
crystallize under normal pressure even for the lowest tempera- 
tures and beoomes supra-fluid at about 2° absolute. The other 
case is that of the electrons in metals which, though not an 
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ordinary fluid, behave in many respects like one and, owing to 
their tiny mass, exhibit quantum properties, the strangest of 
which is supra-conductivity. 

In order to confirm the principles of quantum statistics, 
investigations of these two cases are of great interest. Both 
have been studied theoretically in my department in Edinburgh, 
and I wish to say a few words about our results. 

In the supra-fluid state helium behaves very differently from a 
normal liquid. It appears to lose its viscosity almost completely; 
it flows through capillaries or narrow slits with a fixed velocity 
almost independent of the pressure, creeps along the walls of the 
container, and so on. A metal in the supra-conductive state has, as 
the name says, no measurable electrical resistance and behaves 
abnormally in other ways. A common and very conspicuous 
feature of both phenomena is the sharpness of the transition 
point which is accompanied by an anomaly of the specific 
heat: it rises steeply if the temperature approaches the critical 
value T c from below, and drops suddenly for T = T 0 , so that the 
graph looks like the Greek letter A; hence the expression A-point 
for T c . However, this similarity cannot be very deeply rooted. 
Where has one to expect, from the theoretical standpoint, the 
beginning of quantum phenomena ? Evidently when the momen- 
tum p of the particles and some characteristic length l are reaching 
the limit stated by the uncertainty principle, pi ~ ft. If we 
equate the kinetic energy p 2 /2m to the thermal energy kT, 
the critical temperature will be given by lcT c • — - ft 2 /2ml 2 . If one 
substitutes here for k and ft the well-known numerical values and 
for m the mass of a hydrogen atom times the atomic mass 
number /x, one finds, in degrees absolute, 

OO 

Tc~Jz, (9.50) 

where l is measured in Angstrom units (10~ 8 cm.). 

For a helium atom one has /z = 4, and if l is the mean distance 
of two atoms (order 1 A) one obtains for T e a few degrees, which 
agrees with the observed transition at about 2°. But for elec- 
trons in metals one has /z = 1/1840 . If one now assumes one 
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electron per atom and interprets l as their mean distance it would 
be again of the order 1 A, hence the expression (9.50) would 
become some thousand degrees and has therefore nothing to do 
with the A-point of supra-conductivity. This temperature has, 
in fact, another meaning; it is the so-called ‘degeneration tem- 
perature 5 T g of the electronic fluid; below T g> for instance at 
ordinary temperatures, there are already strong deviations from 
classical behaviour (e.g. the extremely small contribution of the 
electrons to the specific heat), though not of the extreme charac- 
ter of supra-conductivity. In order to explain the A-point of 
supra-conductivity which lies for all metals at a few degrees 
absolute, one has to take l about 200 times larger (~ 200 A). 
As the interpretation of this length is still controversial, I shall 
not discuss supra-conductivity any further (see Appendix, 34). 

Nor do I intend in the case of supra-fluidity of helium to give 
a full explanation of the A-discontinuity, but I wish to direct 
your attention to the thermo-mechanical properties of the 
supra-liquid below the A-point, called He II. 

I have already mentioned that in quantum liquids one has to 
distinguish the ordinary thermodynamic temperature T and 
pressure p from the kinetic temperature T x and pressure p v 
The hydro-thermal equations contain only T x and p l9 and these 
quantities are constant in equilibrium, i.e. for a state where no 
change in time takes place. But T x and p x are not simple 
functions of T and p but depend also on the velocity and its 
gradient. Therefore in such a state permanent currents of mass 
and of energymay flow asifno viscosityexisted. This is reflected in 
the energy balance which can be derived from the hydro-thermal 
equations. One obtains a curious result which looks like a viola- 
tion of the first law of thermodynamics; for the change of heat 
is given by 

dQ = TdS = dU+pdV-Vdm , (9.51) 

where all symbols have the usual meaning, and tt = p ± — p is 
the difference of the kinetic and thermodynamic pressures. This 
equation differs from the ordinary thermodynamical expression 
(5.12) by the term — V d7r;howis this possible if thermodynamics 
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claims rightly universal validity ? This claim is quite legitimate, 
but the usual form of the expression for dQ depends on the 
assumption that a quasi-static, i.e. very slow, process can be 
regarded as a sequence of equilibria each determined by the 
instantaneous values of pressure and volume. This is correct in 
the classical domain, because if the rate of change of external 
action (compression, heat supply, etc.) is slowed down, all veloci- 
ties in the fluid tend to disappear. Not so in quantum mechanics. 
In consequence of the indeterminacy condition the momenta or 
the velocities cannot decrease indefinitely if the coordinates of 
the particles are restricted to very small regions. An investiga- 
tion of the hydro-thermal equations shows that this effect is 
preserved, to some degree, even for the visible velocities; it is 
true there can exist a genuine statistical equilibrium where the 
density is uniform and the currents of mass and energy vanish, 
but there are also those states possible where certain combina- 
tions of currents of mass (velocities) and of energy (heat) per- 
manently exist. The production of these depends entirely on 
the way in which the heat dQ is supplied to the system and cannot 
be suppressed by just making the rate of change of volume very 
small. We have therefore not a breakdown of the law of con- 
servation of energy but of its traditional thermodynamical 
formulation. 

The consequences of that extra term in (9.51) are easily seen 
by introducing instead of the internal energy the quantity 

E — U — irV (9.52) 

in the expression (9.51) for dQ, which then reads 

dQ = dE+pydV, (9.53) 

where p t = p+n is the kinetic pressure. This shows that the 
specific heat at constant volume is 



not ( dU/dT) v , as in classical thermodynamics. Now as p v and 
therefore v — p x —p, is very large at T = 0 and decreases with 
increasing T to reach the value 0 at the A-point, one obtains for 
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c v (T) a curve exactly of the form actually observed. Hence the 
A-anomaly is due to the coupling of heat currents with the 
mass motion characteristic of quantum liquids. It is a molar, 
macroscopic motion, the shape of which depends on the geo- 
metrical conditions, presumably consisting of tiny closed threads 
of fast-moving liquid, or groups of density waves. 

A simila r conception has been derived by several authors 
(Tisza, Mendelssohn, Landau) from the experiments; they speak 
of the liquid being a mixture of ordinary atoms and special 
degenerate atoms (z-particles) which are in the lowest quantum 
state and carry neither energy nor entropy. Yet in a liquid one 
cannot attribute a quantum state to single atoms. 

These considerations are also the clue to the understanding 
of other anomalous phenomena, as the flow through narrow 
capillaries or slits, the so-called fountain effect, the ‘second 
sound’, etc. Green has studied the properties of He II in detail 
and arrived at the conclusion that the quantum theory of liquids 
can account for the strange behaviour of this substance. 

I have dwelt on this special problem in some detail as it reveals 
in a striking way that quantum phenomena are not confined to 
atomic physics or microphysics where one aims at observing 
single particles, but appear also in molar physics which deals 
with matter in bulk. From the fundamental standpoint this 
distinction, so essential in classical physics, loses much of its 
meaning in quantum theory. The ultimate laws are statistical, 
and the deterministic form of the molar equations holds for 
certain averages which for large numbers of particles or quanta 
are all one wants to know. 

Now these molar laws satisfy all postulates of classical 
causality: they are deterministic and conform to the principles 
of contiguity and antecedence. 

With this statement the circle of our considerations about 
cause and chance in physics is closed. We have seen how classical 
physics struggled in vain to reconcile growing quantitative 
observations with preconceived ideas on causality, derived from 
everyday experience but raised to the level of metaphysical 
postulates, and how it fought a losing battle against the intrusion 
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of chance. To-day the order of ideas has been reversed: chance 
has become the primary notion, mechanics an expression of its 
quantitative laws, and the overwhelming evidence of causality 
with all its attributes in the realm of ordinary experience is 
satisfactorily explained by the statistical laws of large numbers. 
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The statistical interpretation which I have presented in the last 
seofaon is now generally accepted by physicists all over the world, 
with a few exceptions, amongst them a most remarkable one. 
As I have mentioned before, Einstein does not accept it, but still 
believes m and works on a return to a deterministic theory. , To 
illustrate his opinion, let me quote passages from two letters. 
The first is dated 7 November 1944, and contains these lines: 

In imserOT wissenschaftlichen Erwartung haben wir uns zu Antipoden 
entwickelt. Du glaubst an den wiirfelnden Gott und ich an voile Gesetz- 
hohkeit in einer Welt von etwas objektiv Seiendem, das ich auf wild 
spekuJativemWeg zu erhaschen suohe. Ich hoffe, dasa einer einen mehr 
reahstaschen Weg, bezw. eine mehr greifbare Unterlage fur eine solche 
Au^ssung finden wird, als esmir gegeben ist. Der grosse anfangliche 
Erfolg der Quantentheone kann mioh doch nicht zum Glauben an das 

fl/m fin tftl n Wlirfolorvinl > 


(la our scientific expectations we have progressed towards antipodes. 

m ^ h ® di ?. e ' playiDg god> ^ 1 “ the P erfect rule of law in a 
world of something objectively existing which I try to catch in a wildly 
speculative way. I hope that somebody will find a more realistic way, 
or a more tangible foundation for such a conception than that which is 
&ven to me. The peat initial success of quantum theory cannot convert 
me to beheve in that fundamental game of dice.) 

The second letter, which arrived just when I was writing these 
pages (dated 3 December 1947), contains this passage: 

‘Meine physikalische Haltung kann ich Dir nicht so begriinden, dass 
sieirgendwie vernunftig finden wiirdest. Ich sehe natSi ein, dass 
pmcipieU statisteihe Behandiungsweise, deren Notwendigkeit im 
Rahmen des bestebenden Formalismus ja zuerst von Dir klar erkannt 

St^e^X^T^T hat ‘ Ich & ber deshalb 

mcht ernsthaft damn glauben, weil die Theorie mit dem Grundsatz 

^mmbm wt dass die Physik eine Wirklichkeit in Zeit und Baum 

darstellen soli, ohne spukhafte Femwirkungen. . . . Davon bin ich fest 

uberzeugt, dass man schliesshch bei einer Theorie landen wird dernn 

^ M Wahla °heinHchkeiten, sondem 

SSS? 4 ? v Smd ’ man 68 bis VOT kurzem als selbstver- 

iS^S* ZW Be S r0adan 8 dieser Uberzeugung kann 

ZeuvfinW? 1 ° glSOll 1 e G f™ de ’ S0nd6m *»» meinen kleinenFhJer als 

.SdL^ I1 T a ’ ® Autoritat ’ ^ ausserhalb meiner Haut 

irgendwelchen Respekt einfldssen kann.’ 
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(I cannot substantiate my attitude to physics in such a manner that 
you. would find it in any way rational. I see of course that the statistical 
interpretation (the necessity of which in the frame of the existing for- 
malism has been first clearly recognized by yourself) has a considerable 
content of truth. Yet I cannot seriously believe it because the theory 
is inconsistent with the principle that physics has to represent a reality 
in space and time without phantom actions over distances. ... I am 
absolutely convinced that one will eventually arrive at a theory in whioh 
the objects connected by laws are not probabilities, but conceived facts, 
as one took for granted only a short time ago. However, I cannot provide 
logical arguments for my conviction, but can only call on my little finger 
as a witness, which cannot claim any authority to be respected outside 
my own skin.) 

I have quoted these letters because I think that the opinion 
of the greatest living physicist, who has done more than anybody 
else to establish modem ideas, must not be by-passed. Einstein 
does not share the opinion held by most of us that there is over- 
whelming evidence for quantum mechanics. Yet he concedes 
‘initial success’ and c a considerable degree of truth’ . He ob- 
viously agrees that we have at present nothing better, but he 
hopes that this will be achieved later, for he rejects the 'dice- 
playing god’. I have discussed the chances of a return to deter- 
minism and found them slight. I have tried to show that classical 
physics is involved in no less formidable conceptional difficulties 
and had eventually to incorporate chance in its system. We 
mortals have to play dice anyhow if we wish to deal with atomic 
systems. Einstein’s principle of the existence of an objective 
real world is therefore rather academic. On the other hand, his 
contention that quantum theory has given up this principle is 
not justified, if the conception of reality is properly understood. 
Of this I shall say more presently. 

Einstein’s letters teach us impressively the fact that even an 
exact science like physics is based on fundamental beliefs. The 
words ich glavbe appear repeatedly, and once they are under- 
lined. I shall not further discuss the difference between Ein- 
stein’s principles and those which I have tried to extract from 
the history of physics up to the present day. But I wish to 
collect some of the fundamental assumptions which cannot be 
further reduced but have to be accepted by an act of faith. 
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Causality is such a principle, if it is defined as the belief in the 
existence of mutual physical dependence of observable situa- 
tions. However, all specifications of this dependence in regard 
to space and time (contiguity, antecedence) and to the infinit e 
sharpness of observation (determinism) seem to me not funda- 
mental, but consequences of the actual empirical laws. 

Another metaphysical principle is incorporated in the notion 
of probability. It is the belief that the predictions of statistical 
calculations are more than an exercise of the brain, that they can 
be trusted in the real world. This holds just as well for ordinary 
probability as for the more refined mix ture of probability and 
mechanics formulated by quantum theory. 

The two metaphysical conceptions of causality and probability 
have been our main theme. Others, concerning logic, arithmetic, 
space, and time, are quite beyond the frame of these lectures. 
But let me add a few more which have occasionally occurred, 
though I am sure that my list will be quite incomplete. One is 
the belief in harmony in nature, which is something distinct from 
causality, as it can be circumscribed by words like beauty, 
elegance, simplicity applied to certain formulations of natural 
laws. This belief has played a considerable part in the develop- 
ment of theoretical physics — remember Maxwell’s equations of 
the electromagnetic field, or Einstein’s relativity— but how far 
it is a real guide in the search of the unknown or just the expres- 
sion of our satisfaction to have discovered a sig nifi c an t relation, 
I do not venture to say. Eor I have on occasion made the sad 
discovery that a theory which seemed to me very lovely neverthe- 
less did not work. And in regard to simplicity, opinions will 
differ in many cases. Is Einstein’s law of gravitation simpler 
than Newton’s? Trained mathematicians will answer Yes, 
meaning the logical simplicity of the foundations, while others 
vdh say emphatically No, because of the horrible complication 
of the formalism. However this may be, this kind of belief may 

help some specially gifted men in their research; for the validity 
of the result it has little importance (see Appendix, 35). 

The last belief I wish to discuss may be called the principle of 
objectivity. It provides a criterion to distinguish subjective 
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impressions from objective facts, namely by substituting for 
given sense-data others which can be checked by other indivi- 
duals. I have spoken about this method when I had to define 
temperature: the subjective feeling of hot and cold is replaced 
by the reading of a thermometer, which can be done by any 
person without a sensation of hot or cold. It is perhaps the most 
important rule of the code of natural science of which innumer- 
able examples can be given, and it is obviously closely related 
to the conception of scientific reality. For if reality is understood 
to mean the sum of observational invariants — and I cannot see 
any other reasonable interpretation of this word in physics — the 
elimination of sense qualities is a necessary step to discover 
them. 


Here I must refer to the previous Waynflete Lectures given by 
Professor E. D. Adrian, on The Physical Background of Percep- 
tion, because the results of physiological investigations seem to 
me in perfect agreement with my suggestion about the meaning 
of reality in physics. The messages which the brain receives have 
not the least similarity with the stimuli. They consist in pulses 
of given intensities and frequencies, characteristic for the trans- 
mitting nerve-fibre, which ends at a definite place of the cortex. 
All the brain ‘learns’ (I use here the objectionable language of 
the ‘disquieting figure of a little hobgoblin sitting up aloft in the 
cerebral hemisphere’) is a distribution or ‘map’ of pulses. From 
this information it produces the image of the world by a process 
which can metaphorically be called a consummate piece of com- 
binatorial mathematics: it sorts out of the maze of in diff erent 
and varying signals invariant shapes and relations which form 
the world of ordinary experience. 

This unconscious process breaks down for scientific ultra- 
experience, obtained by magnifying instruments. But then it is 
continued in the full light of consciousness, by mathematical 
reasoning. The result is the reality offered by theoretical physios. 

The principle of objectivity can, I think, be applied to every 
human experience, but is often quite out of place. For instance: 
what is a fugue by Bach ? Is it the invariant cross-section, or the 
common content of all printed or written copies, gramophone 
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records, soundwaves at performances, etc., of this piece of music ? 
As a lover of music I say No ! that is not what I mean by a fugue. 
It is something of another sphere where other notions apply, 
and the essence of it is not ‘notions 9 at all, but the immediate 
impact on my soul of its beauty and greatness. 

In cases like this, the idea of scientific objective reality is 
obviously inadequate, almost absurd. 

This is trivial, but I have to refer to it if I have to make good 
my promise to discuss the bearing of modem physical thought 
on philosophical problems, in particular on the problem of free 
will. Since ancient times philosophers have been worried how 
free will can be reconciled with causality, and after the tremen- 
dous success of Newton’s deterministic theory of nature, this 
problem seemed to be still more acute. Therefore, the advent of 
indeterministic quantum theory was welcomed as opening a 
possibility for the autonomy of the mind without a clash with 
the laws of nature. Free will is primarily a subjective pheno- 
menon, the interpretation of a sensation which we experience, 
similar to a sense impression. We can and do, of course, project 
it into the minds of our fellow beings just as we do in the case 
of music. We can also correlate it with other phenomena in order 
to transform it into an objective relation, as the moralists, 
sociologists, lawyers do — but then it resembles the original 
sensation no more than an intensity curve in a spectral diagram 
resembles a colour which I see. After this transformation into 
a sociological concept, free will is a symbolic expression to 
describe the fact that the actions and reactions of human beings 
are conditioned by their internal mental structure and depend on 
their whole and unaccountable history. Whether we believe 
theoretically in strict determinism or not, we can make no use 
of this theory since a human being is too complicated, and we 
have to be content with a working hypothesis like that of spon- 
taneity of decision and responsibility of action. If you feel that 
this clashes with determinism, you have now at your disposal 
the modem indeterministic philosophy of nature, you can assume 
a certain freedom , i.e. deviation from the deterministic laws, 
because these are only apparent and refer to averages. Yet if 
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you believe in perfect freedom you will get into difficulties again, 
beoause you cannot neglect the laws of statistics which are laws 
of nature. 

I think that the philosophical treatment of the problem of free 
will suffers often (see Appendix, 36) from an insufficient dis- 
tinction between the subjective and objective aspect. It is 
doubtless more difficult to keep these apart in the case of such 
sensations as free will, than in the case of colours, sounds, or 
temperatures. But the application of scientific conceptions to a 
subjective experience is an inadequate procedure in all such 
cases. 

You may call this an evasion of the problem, by means of 
dividing all experience into two categories, instead of trying to 
form one all-embracing picture of the world. This division is 
indeed what I suggest and consider to be unavoidable. If quan- 
tum theory has any philosophical importance at all, it lies in the 
fact that it demonstrates for a single, sharply defined science the 
necessity of dual aspects and complementary considerations. Niels 
Bohr has discussed this question with respect to many applica- 
tions in physiology, psychology, and philosophy in general. 
According to the rule of indeterminacy, you cannot measure 
simultaneously position and velocity of particles, but you have 
to make your choice. The situation is similar if you wish, for 
instance, to determine the physico-chemical processes in the 
brain connected with a mental process : it cannot be done because 
the latter would be decidedly disturbed by the physical investiga- 
tion. Complete knowledge of the physical situation is only 
obtainable by a dissection which would mean the death of the 
living organ or the whole creature, the destruction of the mental 
situation. This example may suffice; you can find more and 
subtler ones in Bohr s writings. They illustrate the limits of 
human understanding and direct the attention to the question 
of fixing the boundary line, as physics has done in a narrow 
field by discovering the quantum constant Ti. Much futile 
controversy could be avoided in this way. To show this by a 
final example, I wish to refer to these lectures themselves whioh 
deal only with one aspect of science, the theoretical one. There 
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is a powerful school of eminent scientists who consider such 
things to be a futile and snobbish sport, and the people who 
spend their time on it drones. Science has undoubtedly two 
aspects: it can be regarded from the social standpoint as a prac- 
tical collective endeavour for the improvement of h uman 
conditions, but it can also be regarded from the individualistic 
standpoint, as a pursuit of mental desires, the hunger for know- 
ledge and understanding, a sister of art, philosophy, and religion. 
Both aspects are justified, necessary, and complementary. The 
collective enterprise of practical science consists in the end of 
individuals and cannot thrive without their devotion. But 
devotion does not suffice; nothing great can be achieved without 
the elementary curiosity of the philosopher. A proper balance is 
needed. I have chosen the way which seemed to me to harmonize 
best with the spirit of this ancient place of lear ning . 



APPENDIX 

1. (II. p. 8.) Multiple causes 

Any event may have several causes. This possibility is not 
excluded by my definition (given explicitly on p. 9), though I 
speak there of A being ‘the’ cause of the effect B. Actually the 
‘number’ of causes, i.e. of conditions on which an effect B 
depends, seems to me a rather meaningless notion. One often 
finds the idea of a ‘causal chain’ A 1; A 2 ,..., where B depends 
directly on A lt A-^ on A z , etc., so that B depends indirectly on 
any of the A n . As the series may never end— where is a ‘first 
cause’ to be found ? — the number of causes may be, and will be 
in general, infinite. But there seems to be not the slightest 
reason to assume only one such chain, or even a number of 
chains; for the causes may be interlocked in a complicated way, 
and a ‘network’ of causes (even in a multi-dimensional space) 
seems to be a more appropriate picture. Yet why should it be 
enumerable at all ? The ‘set of all causes’ of an event seems to 
me a notion just as dangerous as the notions which lead to logical 
paradoxes of the type discovered by Russell. It is a metaphysical 
idea which has produced much futile controversy. Therefore I 
have tried to formulate my definition in such a way that this 
question can be completely avoided. 

2. (III. p. 13.) Derivation of Newton’s law from Kepler’s 
laws 

The fact that Newton’s law is a logical consequence of Kepler’s 
laws is the basis on which my whole conception of causality in 
physics rests. For it is, apart from Galileo’s simple demonstra- 
tion the first and foremost example of a timeless cause— effect 
relation derived from observations. In most text-books of 
mechanics the opposite way (deduction of Kepler’s laws from 
Newton’s) is followed. Therefore it may be useful to give the 
full proof in modem terms. 

We begin with formulating Kepler’s laws, splitting the first 
one in two parts: 6 

Id. The orbit of a planet is a plane curve. 

I b. It has the shape of an ellipse, one focus of which is the sun. 
8181 „ 
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n. The area A swept by the radius vector increases propor- 
tionally to time. 

TTT The ratio of the cube of the semi-axis a of the ellipse to 
the square of the period T is the same for all planets. 

From la it follows that it suffices to consider a plane, intro- 
ducing rec tangular coordinates x,y, polar coordinates r,tf>, so 

tk 0 * x = rcos<f>, y = rsm<f>. 

Indicating differentiation with respect to time by a dot, one 
obt ains for the velocity 

x = r cos <f> — rtf> sin <f>, y = r sin <f>-\-r<j> cos <f>, 

and for the acceleration 

x = a, cos <j>—a $ sin tf>, y = a r sin <£+a^ cos <f>, 

where a P = f—r<j > 2 , (1) 

a ^ = 2 r<j>+r$ (2) 

are the radial and tangential components of the acceleration. 

Next we use II. The element of the area in polar coordinates 
is obviously dA = ^ 2 ^. 



If the origin is taken at the centre of the snn, the rate of in- 
crease of A is constant, say \fi, dA = \hdt , or 

2 A = r 2 <f> = h . 

Now it is convenient to use the variable 

1 

u = - 
r 


( 3 ) 
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instead of r and to describe the orbit by expressing u as a function 
of <f>, u(<f>). Then , 


*-3-***. 

W 


( 5 ) 


(6) 

Substituting (4), (5), (6) into (2), one finds 



= 0 . 

Hence the acceleration has only a radial component a r ■with 
respect to the sun. To obtain the value of a r we calculate, with 
the help of (4), 

. __ dr ■ _ h dr _ , du 

ty 9 ~ 7 2 dj> ~ dj,’ 



and substitute this in (1): 

Now we use I b. The polar equation of an ellipse is 


r = ^ 


1-fecos^’ 


( 9 ) 


where q is the semi-latus rectum and e the numerical eccentricity; 


or 


u — -(1-J-ecos^). 


From this, one obtains 


du 




hence from (8) a. = — —u 2 = — — -1. 

q q r 2 


( 10 ) 


The acceleration is direoted to the sun (centripetal) and is 
inversely proportional to the square of the distance. 
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According to the third law HI one oan write 

a 8 _ ft 
T 8 “ 4t r 2 ’ 


( 11 ) 


where the constant fi is the same for all planets. 

Now integrating (3) for a full revolution one has 

2 A = hT. (12) 

On the other hand, the area of an ellipse is given by 

A = nab, (13) 

where a and 6 are the major and minor semi-axes. 

Taking in (9) ^ = 0 and <f> = it one gets the aphelion and 
perihelion distances; half of the sum of those is the semi-major 


axis: 


2\l+e + l- c / l_ e 2> 


while the semi-minor axis is given by 

hence aq = b 2 . 

Substituting this in (13), one gets from (12) 

solving with respect to h*/q and using (11): 

Aa a- 2 a * 

7 = 47r V* = /x - 

Therefore the law of acceleration (10) becomes 


(14) 


a r = 



(15) 


where n is the same for all planets, hence a property of the sun, 
called the gravitational mass. 

This demonstrates the statement of the text that Newton’s 
derivation of his law of force is purely deductive, based on the 
inductive work of Tycho Brahe and Kepler. The new feature due 
toNewton is the theoretical interpretation of the deduced formula 
for the acceleration, as representing the ‘cause’ of the motion. 
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or the force detemiirring the motion, which then led Him to the 
fundamental idea of general gravitation (each body attracts each 
other one). In the text-books this situation is not always clear; 
this may be due to Newton’s own representation in hia Principia 
where he uses only geometrical constructions in the classical 
style of the Greeks. Yet it is known that he possessed the 
methods of infinitesimal calculus (theory of fluxions) for many 
years. I do not know whether he actually discovered his results 
with the help of the calculus; it seems to me incredible that he 
should not. He was obviously keen to avoid new mathematical 
methods in order to comply with the taste of his contemporaries. 
But it is known also that he liked to conceal his real ideas by 
dressing them up. This tendency is found in Gauss and other 
great mathematicians as well and has survived to our tim e, 
much to the disadvantage of science. 


Newton regarded the calculation of terrestrial gravity from 
astronomical data as the crucial test of his theory, and he with- 
held publication for years as the available data about the radius 
of the earth were not satisfactory. The formula (3.3) of the text 
is simply obtained by regarding the earth as central body and 
the moon as ‘planet*. Then fi is the gravitational mass of the 
earth which can be obtained from (11) by inserting for a the 
mean distance R of the centre of the moon from that of the earth, 
and for T the length of the month. Substituting p = 4^r 2 R 2 /T 2 
into (15), where r is the radius of the earth, one obtains for the 
acceleration on the earth’s surface g ( = — a r ) the formula (3.3) 
of the text, _ i^RlRV 

7 ) * 


9 = 


(16) 


If here the values Rjr — 60, R = 3*84 x 10 10 cm., and 
T — 27 d 7 h 43 m 11 '6 s = 2*361 x 10® sec. are substituted, one 
finds g = 980 *2 cm. sec , -2 , while the observed value (extrapolated 
to the pole) is g = 980*6 cm. sec. -2 

This reasoning is based on the plausible assumption that the 
acceleration produced by a material sphere at a point outside is 
independent of the radial distribution of density and the wmi of 
the sphere can therefore be regarded as concentrated in the 
centre. The rigorous proof of this lemma forms an important 
part of Newton’s considerations and was presumably achieved 
with the help of his theory of fluxions. 
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3. (IV. p. 20.) Cauchy's mechanics of continuous media 
The mathematical tool for handling continuous substances is 
the following theorem of Gauss (also attributed to Green). 

If a vector field A is defined inside and on the surface $ of a 
volume F, one has 

J*divAdF = J A.ndS, (1) 

where n is the unit vector in the direction of the outer normal of 
the surface element dS and 


divA = 8 A*+ 8 Av + 8 A° = _i . A. 

dx dy dz 0X 


( 2 ) 


If p is the density, the total mass inside F is 

m = J p dV. (3) 

The amount of mass leaving the volume through the surface is 


u . n dS, 


where u = pv is the current, v the velocity. 

The indestructibility of mass is then expressed by 


m+ J u.n dS = 0. 


Substituting (3) and applying (1), one obtains a volume 

integral, which vanishes for any surface; hence its integrand 

must be zero: . , 

p+divu = 0. ( 4 ) 


This is the continuity equation (4.5) of the text. 

Consider now the forces acting on the volume V. Neglecting 
those forces which act on each volume element (like Newton’s 
gravitation), we assume with Cauchy that there are surface 
forces or tensions, acting on each element dS of the surface S, 
and proportional to dS . They will also depend on the orientation 
of dS, i.e. on the normal vector n, and can therefore be written 
T n dS. If n coincides with one of the three axes of coordinates 
x,y,z, the corresponding forces per unit area may be repre- 
sented by the vectors T x , T„, T s . Now the projections of an 
element dS on the coordinate planes are 

dS x = n x dS, dS v = n y dS, dS B = n e dS. 
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The equilibrium of the tetrahedron -with the sides dS, dS x , dS y , dS„ 
then leads to the equation 

T n dS = T.dBL+T y dS y +T,dS„ 
or T n = T ai » a ,+Ty» tf +T B »i a , (5) 

whioh is the formula (4.6) of the text. 



Consider further the equilibrium of a rectangular volume 
element, and in particular its cross-section z = 0, -with the sides 
dx, dy. The components of Tj, in this plane may be denoted by 
and T^, those of T„ by T yx , T vy . Then the tangential com- 



Fia. 5. 

ponents on the surfaces dydz and dxdz produce a couple about the 
origin 0 with the moment 

(Tcy dydz)dx — (T yx dxdz)dy. 

This must vanish in equilibrium; therefore one has 

m m 
- L xy • L yx9 

and the corresponding equations obtained by cyclic permutation 
of the indices, (4.7) of the text. Hence the stress tensor T defined 
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by (4-8) is symmetrical. One can express this, with the help of 
(5), in the form 

( T »)* = Txx n x+T vx n v+ T zx n z = T^.n, 

where T x is the vector (T^, T^, TJ. 

The ^-component of the total force F = f T n dS can now be 
transformed with the help of formula (1) into a volume integral 

= / (T ») x d8 = f T x .ndS = f divT a dF. 
s s v 

Using the tensor notation of the text, (4.10), one can write this 

F = J div T dV. (6) 

This has to be equated to the rate of change of momentum of a 
given amount of matter, i.e. enclosed in a volume moving in 
time. One has for any function O of space 

V«+A«) F(t) ' 

= S,5{/¥ A “ iF +/' t ' i 4 

av ' 

The second integral is extended over the volume between two 
infinitesimally near positions of the surface, so that 

dV = n.vAtdS 

and therefore 


Hence 


J ® dV =J ® n -ydS At =J div(<Dv) dV At. 

It J 0 dV = J |^+div(Ov)| dV. (7) 

77 ’ 


If this is applied to the < 

) 


JnnTdh! PP ? ed .? the corn P° nents of the momentum density 
pv one obtains for the rate of change of the total momentum P : 

— = £ f 
dt dt J 


-div(pw)J 


momentum P: 

dV 


~/ K5 + ( v, ^H +v {a* +div H] dV - 
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Here the second integral vanishes in consequence of the continu- 
ity equation (4), with u == pv. In the first integral appears the 
convective derivative, defined by (4.11) of the text, 


Hence 


dv &v , / d\ 
dt = ~at + \ ' dx) y ' 

dP C dv , T/ 

s- } fi w dr - 


( 8 ) 


Now the equation of motion 

d P 

dt 

reduces in virtue of (6) and (8) to (4.9) of the text: 


= F 


dv ,. _ 


(9) 


Consider in particular an elastic fluid whore 

I'xx — Tyy = = P, T ye — T m = Tgy = 0 , 

and the pressure p is a function of p alone. Then the continuity 
equation and the equations of motion 


^+di v(/>v) = 0, 


dt 


»{5+(-4)-)+|S=» 


are four differential equations for the four functions p 9 v x , v y , v z . 

If one wishes to determine small deviations from eq uili brium, 
then v and (f> = p — p Q are small and p Q constant with regard to 
space and time. Then the last two equations reduce in first 
approximation to , 

-fi+Po divv= 0, 


dv.fdp) 

Po dt '\dp) 0 dx 


By differentiating the first of these with respect to time and 

substituting p 0 — from the second, one finds 

dt 

l d P\ i:JW\ _ ^ 
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^1-44=^ 


(!).-■ 

(10) 

l 

c a dt 1 ~ 

(11) 


density 6 ?^ e tezt applied to the variation of 

equation wW? Vel °° lty components satisfies the same 
equation, which is the prototype of all laws of wave propagation. 

4 fiS P ' 24 ) MaxweU ’ 8 e^ations of the electromagnetic 

deS^^!^ 1 *?}***"*'• ™rk consisted in con- 

kw*’ mentioned in the text, in a set 
of differential equations which, with the usual notation, are 


divD = 4 t rp, 

divB==o, cutIE+Ib = o, 


ourlH = — u. 


( 1 ) 


® e ®> B = fiH. 

fieHiflb^dby pSg for theeleotrostatic 

remains . S ~ ’ H °» u = °> then there 

divD = 4tt P , ourlE = 0. 

The second equation imphes that there is a potential <f>, such that 

E = _££ 

8x‘ 

equation^' D = E ’ one obtains therefore Poisson’s 

-divE = A<f> = — 4vp. 

The solution is 


t-jUv, 


( 2 ) 


*>™da expresses 

issSssSSHiSli 
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Maxwell’s physical idea consisted in discovering the asym- 
metry in the equations (1) which, in our style of writing, is 

obvious even to the untrained eye: the missing term ifr in the 

second equation. The logical necessity of this term follows from 
the fact of the existence of open currents, e.g. discharges of con- 
densers through wires. In this case the charge on the condenser 
changes in time, hence p ^ 0; on the other hand, the equations 

(1) imply divu = — divcurlH = 0. Therefore the continuity 

equation p+divu = 0 is violated. 

To amend this Maxwell postulated a new type of ourrent 
bridging the gap between the conductors in the condenser, with 
a certain density w, so that 

curlH = — (u+w). (3) 

Then taking the div operation one has 

div w = —div u = p = — divD. 

vn 

The simplest way of satisfying this equation is putting 

w = ±b, (4) 

so that the corresponding equation in Maxwell’s set becomes 

curlH— -D = — u (5) 

and complete symmetry between electric and magnetic quanti- 
ties is obtained (apart from the fact that the latter have no true 
charge and current). 

The modified system of field equations permits the prediction 
of waves with finite velocity. In an isotropic substance free of 
charges and currents (p — 0, u = 0, D = <=E, B = pH.) one has 

curlH— -E = 0, curlE+-H = 0; 
c c 

taking the curl of one of them, and using the formula that for a 
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vector -with vanishing div one has curl curl = — A, one obtains 
for each component of E and H the wave equation 


a 

c| 8i 2 


VM’ 


For vacuum (e = p = 1) the velocity of propagation should 
therefore be equal to the electromagnetic constant c. As stated 
in the text, this constant has the dimensions of a velocity and 
be measured by determining the magnetic field of a current 
produced by a condenser discharge (measured therefore electro- 
statically). Such experiments had been performed by Kohl- 
rausch and Weber, and their result for c agreed with the velocity 
of light in vacuo. This evidence for the electromagnetic theory of 
light was strongly enhanced by experiments carried out by 
Boltzmann, which showed that the velocity of light in simple 
substances (rare gases, which are monatomic) can be calculated 
from their dielectric constant e (p being practically = 1 ) with the 
help of Maxwell’s formula Cj = c/Ve. 

Maxwell’s formulation satisfies contiguity, but its relation to 
Oauehy s form of the dynamical laws has still to be established, 
lhe electnc and magnetic field vectors, though originally defined 


do not exist), are defined by the equations also in places where 
neither charges nor currents exist. Yet they are not stresses 

d i ^ ^ogous strains, on which the stresses 

depend. The law of this connexion has also been found by Max- 

jSiJ*? 8 .? ma ^ em atical formulation of Faraday’s intuitive 
mterpretation of the mechanical reactions between electrified 

A^^ l b0 ^ eS - A 8hort ^^ion must here suffice. 

th ® electnc for °e on a point charge e, F = eE, 
u ^ A meCWalf0rCe ° n tte ele “ent of a linear current 
to h S ,/ a magneti ° field H ’ this &roe is perpendicular 

^uite^aue^r^ "T* tW ° re does *> It is not 

? determined, as one can obviously add any force 

expression is: ^ & Cl ° Sed oirCuit vanMles - The simplest 

F ~ ~ u A B, 

hr fH £ h 7 “f ? leihlg the ohan g e °f magnetic energy 
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To illustrate Maxwell’s procedure it suffices to consider oharge 
distributions in vacuo with density p and current u = pv. 

Combining the two forces F = eE and F = i u A B into one 

expression, one has for the density of force 

f = p{E+i(v A H)}, (7) 

the so-called Lorentz force. 

Substituting here for p and pv the expressions from Maxwell’s 
equations one can, by elementary transformations, bring f into 
the form of Cauchy, ^ ^ 

where 1 

T xx = 

T yz — — (E y E z +H v H g ), ... . 

These are the celebrated formulae of Maxwell’s tensions. They 
can be easily generalized for material bodies with dielectric 
constant and permeability, and they have become the prototype 
for similar expressions in other field theories, e.g. gravitation 
(Einstein), electronic field (Dirac), meson field (Yukawa). 

5. (IV. p. 27.) Relativity 

It is impossible to give a short sketch of the theory of relativity, 
and the reader is referred to the text-books. The best representa- 
tion seems to me still the article in vol. v of the Mathematical 
Encyclopaedia written by W. Pauli when he was a student, 
about twenty years of age. There one finds a clear statement of 
the experimental facts which led to the mathematical theory 
almost unambiguously. Eddington’s treatment gives the im- 
pression that the results could have been obtained — or even 
have been obtained — by pure reason, using epistemological 
principles. I need not say that this is wrong and misleading. 
There was, of course,' a philosophical urge behind Einstein’s 
relentless effort; in particular the violation of contiguity in 
Newton’s theory seemed to him unacceptable. Yet the greatness 
of his achievement was just that he based his own theory not on 
preconceived notions but on hard facts, facts which were obvious 
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to everybody, but noticed by nobody. The main fact was the 
identity of inertial and gravitational mass, which he expressed 
as the principle of equivalence between acceleration and gravita- 
tion. An observer in a closed box cannot decide by any experi- 
ment whether an observed acceleration of a body in the box is 
due to gravity produced by external bodies or to an acceleration 
of the box in the opposite direction. This principle means that 
arbitrary, non-linear transformations of time must be a dmi tted. 
But the formal symmetry between space- coordinates and time 
discovered by Minkowski made it very improbable that the 
transformations of space should be linear, and this was corro- 
borated by considering rotating bodies : a volume element on the 
periphery should undergo a peripheral contraction according 
to the results of special relativity, but remain unchanged in the 
radial direction. Hence acceleration was necessarily connected 
with deformation. This led to the postulate that all laws of 
nature ought to be unchanged (covariant) with respect to 
arbitrary space-time transformations. But as special relativity 
must be preserved in small domains, the postulate of invariance 
o± the line element had to be made. 

The long struggle of Einstein to find the general co variant field 
equations was due to the difficulty for a physicist to a ssimila te 
tne mathematical ideas necessary, ideas which were in fact 
completely worked out by Riemann and his successors, Levi- 
Uivita, Ricci, and others. 


n ffl!™? t0 t ddl “ re 01117 oneMm »k- The physical significance 
ot the hne element seems to me lather mystical in a genuinely 
OMtamom space-time Hit is rep!aoed by the assumption of 
parallel displacement (affine connexion), this impression of 
mystery rs still farther enhanoed. On 'ihe othe/ZTthe 

“ «« first step intbis<ttrec- 

Pta2^ into tte laws of 

piiysos. There are numerous indications that the further 

S tC ^ t0 a 8e * arate a Ppe®fance of 
of present-dlt nW r “ the ultimate law s. The difficulties 

Centre ? a J out problem of intro- 

relativitv uw £ m way wlu ch satisfies the principle of 
relatmiy. Tins faot seem, to indicate that rdatfaity it«S 
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needs a generalization where the infinitesimal element ds is 
replaced by a finite length. 

The papers quoted in the text are: A. Einstein, L. Infeld, and 
B. Hoffmann, Ann . of Math ., 39, no. 1, p. 65 (Princeton, 
1938); V. A. Pock, Jowm.of Phys. U.S.S.R. l,no. 2, p. 81 (1939). 

6. (V. p. 38.) On classical and modern thermodynamics 

It is often said that the classical derivation of the second law 
of thermodynamics is much simpler than CarathSodory’s as it 
needs less abstract conceptions than Pfaffian equations. But 
this objection is quite wrong. For what one has to show is the 
existence of an integrating denominator of dQ. This is trivial 
for a Pfaffian of two variables (representing, for example, a single 
fluid with F,#); it must be shown not to be trivial and even, in 
general, wrong for Pfafifians with more than two variables (e.g. two 
fluids in thermal contact with V J, l^,#). Otherwise, the student 
cannot possibly understand what the fuss is all about. But that 
means explaining to him the difference between the two classes 
of Pfafifians of three variables, the integrable ones and the non- 
integrable ones. Without that all talk about Carnot cycles is just 
empty verbiage. But as soon as one has this difference, why not 
then use the simple criterion of accessibility from neighbouring 
points, instead of invoking quite new ideas borrowed from 
engineering ? I think a satisfactory lecture or text-book should 
bring this classical reasoning as a corollary of historical interest, 
as I have suggested long ago in a series of papers ( Phys . Zeitschr. 
22, pp. 218, 249, 282 (1921)). 

Since writing the text I have come across one book which 
gives a short account of Carath6o dory’s theory, H. Margenau 
and G. M. Murphy, The Mathematics of Physics and Chemistry 
(D. van Nostrand Co., New York, 1943), § 1.15, p. 26. But 
though the mathematics is correct, it does not do justice to the 
idea. For it says on p. 28: ‘This formal mathematical con- 
sequence of the properties of the Pfaff equation [namely the 
theorem proved in the next section of the appendix] is known as 
the principle of Carath6odory. It is exactly what we need for 
thermodynamics.’ Carath6odory’s principle is, of course, not 
that formal mathematical theorem but the induction from obser- 
vation that there are inaccessible states in any neighbourhood, 
of a given state. 
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7. (V. p. 39.) Theorem of accessibility 
An example of a Pfaffian which has no integrating denomina- 
tor (by the way, the same example as described in geometrical 
terms in the text) is this: 


dQ = — y dx- fa; dy+k dz, 

where k is a constant. If it were possible to write dQ in the 
form Xd<f>, where A and <f> are functions of x, y, z, one would have 


hence 


dd> y 
8x = ~\’ 


dtf> x 

&y~ A’ 


H_k 
8z ~ A’ 


dydz 


d_(x 
' 8z\ A, 


8_(k 

8y l A 


8z\x) dx\X’ 


dzdx 


_±(y\ _ 8[x\ 

Q .,1 \ I o.. I \ I > 


or 


tefy 8y\X) dx\X 


8\ 

*5" V 


-y— = k — 
V dz “dx’ 


= a; - — \-y — . 

dx^^ dy 


*/ 

By substituting dX/dx and dX/dy from the first two equations 
m the third one finds . 

A = 0. 

. Exa “pfes 1 like this show clearly that the existence of an 
integrating denominator is an exception. 

P r °of of the theorem of accessibility. 
Consider the solutions of the Pfaffian y 


dQ — X dx-\-Y dy-\-Z dz — 0, 
which lie in a given surface S, 

* = x{u,v), y = y(u,v), z = z(u,v). 
They satisfy a Pfaffian 


( 1 ) 


dQ= Udu+Vdv = 0, 



( 2 ) 


where 
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Hence through every point P of S there passes one curve, because 
(2) is equivalent to the ordinary differential equation 

du_ V 

Tv-~U’ 

which has a one-parameter set of solutions <f>(u, v) = const., 
covering the surface S. 

Letusnowsuppose that, in the neighbourhood of apointP, there 
are inaccessible points ; let Q be one of these. Construct thr o ugh 



Fig. 6. Fig. 7. 


P a straight line -Sf, which is not a solution of ( 1 ), and the plane ir 
through Q and SP. In rr there is just one curve satisfying (1) and 
going through Q; this curve will meet the line :S?at a point 12. 
Then B must be inaccessible from P; for if there should exist a 
solution leading from P to B, then one could also reach Q from 
P by a continuous (though kinked) solution curve, which 
contradicts the assumption that Q is inaccessible from P. The 
point B can be made to lie as near to P as one wishes by choosing 
Q near enough to P. 

Now we move the straight line ^parallel to itself in a cyclic 
way so that it describes a closed oylinder. Then there exists on 
this cylinder a solution curve # which starts from P on -S? and 
meets S£ again at a point N. It follows that N and P must coin- 
cide. For otherwise one could, by deforming the cylinder, make 
N sweep along the line JS? towards P and beyond P. Hence there 
would be an interval of accessible points (like N) around P, 

5131 T 
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while it has been proved before that there are inaccessible points 
v in any neighbourhood of P. 

_ \ n °Z c ^ mc]d ® s P the connecting curve # can be 
made, by steady deformation of the cylinder, to desoribe a 
surfoce which contains obviously all solutions starting from P. 
If this surface is given by y, z) = 0, one has 

dQ = Xd<j>, 

which is the theorem to be proved. 

The function j and the factor A are not uniquely determined: 

if ^ is replaced bygone has 

A d<f>= Ad® with A = A—. 

d<f>' 

8. (V. p. 43.) Thermodynamics of chemical equilibria 

t>,^ a ? 40dOry ’ S , P ubIica tion on his foundation of 

thermodynamics {Math. Ann. 61, p. 365, 1909) is written in a 

He considers a type of systems which are 

neclZ P to fT 6 f n . ed *7 tlie P ro P erfc y that of the parameters 
necessary to fk a state of equilibrium all except one are con- 
figurational variables, i.e. such that their values can be axbi- 

theorv^oflfi^i t my own presentation of the 

end S ) bni 1 T 0 ^ ^ Appendix - 6 )> there 18 only a hint at the 
end (§ 9) how such variables can be introduced in more complica- 

can be chmsed - 1 hoped at 

needs nothW 6 by the chemists themselves, for it 

Sb iShSST 2 th6 ^ meth0d of ^mi-permeable 
SnSeU 1 2 m0d f cation of wording. As tlds haa not 

® Ye . he f e a sh °rt indication how to do it. 

arraTlfS 2 & (™thout decomposition), but 

mdepfndeiSvrf & V ° lmne F ^ mass M are both 

rcvSt wi, ?! ab e - F ° r thh pmpose one ba ® to imagine 
bv a valvA +>, a p “t°n attached to the volume V, connected 

volume^? Stance can be pressed into the 
j . . considered. The position of the auxiliary niston 

sssr rv* * .°°“ tai ‘ e4 “ rs 

If the valve is closerf^^ 0 ^ Vadable “ Carath^odory’s sense. 

^ ^4 r 
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dent configuration variables, and the work done for any change 
of them must be regarded as measurable. If this work is deter- 
mined adiabatically one obtains the energy function, say in 
terms of F, M and the empirical temperature Z7(F, M,d). 



When this is known the differential of heat is defined by the 
difference dQ = dU+pdV-pdM, (1) 


where p and ju. are functions of the state (F, M,&) like U, which 
can be regarded as empirically known. 

Now one has in (1) a Pfaffian of three variables and can apply 
the same considerations as before which lead to the result that <2$ 
is integrable and can be equated to T dS. Hence one can write 


dU = TdS—pdV+pdM. (2) 

But U must be a homogeneous function of the first order in the 
variables S, F, M. If one introduces the specific variables 

by U = Mu, S = Ms, V — Mv, (3) 

one has according to Euler’s theorem 


u = Ts—pv+p, 


where 


T — — 

8s’ 


8u 


(4) 

(5) 


and then, from (2), du = Tds—pdv. (6) 

If the substance inside V is a chemical compound and one wishes 
to investigate its decomposition into n components, one assumes 
n cylinders with pistons attached to F, separated from F by 
semi-permeable walls, each of which allows the passage of only 
one of the components. Then one has in the same way 


dU = TdS-pdV+^pidMt, 


(7) 
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where V, M 1; can be regarded as configuration variables 

and U, p, Pi,p z ,...,p n aa known functions of these. Now, as above, 
the specific energy, entropy, and volume are introduced and 
further the concentrations c t by 


M { = Cl M, 

(8) 

where M is the total mass: M = J Jf 1( hence 


n 

= *■ 

(9) 

One obtains from Euler’s theorem 


u = Ts-pv+ | p t c it 

(10) 

with T = — v — 8u „ 8u 

8s’ P dv’ N dc { ’ 

(11) 


where the differentiation with respect to the is performed as 
if they were independent; and 


du = Tds-pdv+jr^dCf. ( 12 ) 

. Tlle formalism of thermodynamics consists in deriving rela- 
tions between the variables by differentiating the equations 
( n )» e.g. 8T 

0-1 __ dp dp t dp 

a*’ 8v ~ a^’ etc - 

As experiments are often performed at constant pressure or 
temperature or both, one uses instead of <0, the 

functions free energy u-Ts =/, enthalpy u+pv = or free 

en ft n Ts (defined by (4)). For the latter, for 

example, it follows from (12) that 

dp, = sdT+vdp+ ^ p i dc t , (13) 

or s = — V = 8 P _ 8 P 

dT’ dp’ < 14 > 

so that for T = const., p = const., one has simply 

dp = Pi dc { . (15) 

The most important theorem, which follows from these general 
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equations, is Gibbs’s phase rule. The system may exist in differ- 
ent phases if the n equations 

Pt(T,p, c v ...,c n ) = C t (i= 1,'..., n) (16) 

have several solutions. Let m be the number of independent 
solutions; then there are m phases which can be ordered in such a 
way that each has contact with two others only. Hence there 
are m—1 interfaces and n equations of the form (16) for each, 
altogether n(m — 1) independent equations. On the other hand, 

there are (n— 1) independent concentrations c x c n _ x for each 

phase, i.e. m(n— 1) for the whole system, to which the two vari- 
ables p, T have to be added; hence m(n — 1)+2 independent 
variables. The number of arbitrary parameters, or the number of 
degrees of freedom of the system, is therefore 

m(n— 1)+2— n(m— 1) = n—m-j-2. 

So for a single pure substance, n = 1, the number of degrees of 
freedom is 3— m; hence there are three cases m — 1 , 2 , 3 corre- 
sponding to one phase, two or three coexisting phases; more than 
three phases cannot be in equilibrium. All further progress in 
thermodynamics is based on special assumptions about the func- 
tions involved, either prompted by experiment, or chosen by an 
argument of simplicity, or — and this is the most important 
step — derived from statistical considerations. 

9. (V. p. 44.) Velocity of sound in gases 

The simple problem of calculating the adiabatic law for an 
ideal gas gives me the opportunity to show how the theory of 
Carath4odory determines uniquely the absolute temperature 
and entropy. 

The ideal gas is defined by two properties: (1) Boyle’s law, 
the isotherms are given by pV = const.; (2) the same quantity 
pV remains constant if the gas expands without doing work. In 
mathematical symbols, 

pV = F(&), U = U{ &). 

Hence dQ = dU+p dV = U'dd+Fy-. (1) 

If 6 is defined by 

log 8 = 


C U'(d)d& 

J m ’ 


(2) 
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(1) can be written 

dQ = F(&) dlog(9V); 

hence one can put A = F(&), <j> = log (6V) and obtain frona the 
equation (5,25) of the text 


wjm _ a ( lo 8 A ) _ Slog F(&) 
d& d& 


Then (5.27) gives, writing C = 1/B, the usual fonn of the 
equation of state ^ _ m _ ^ (s) 


aad S = 8 o +Blog(0F). 

If the special assumption is made, that U depends linearly on 
pV (which holds for dilute gases with the same approximation 
as Boyle’s law), one has U = c v T and, from (2), 



The entropy becomes therefore 


S = S 0 +log(T°*V B ), (4) 

or, substituting p for T from (3), 


where 


8 = tfj+log^ F«»), 

c p = C ®+-B 


is the specific heat for constant pressure, 
law S = const, is equivalent to 


( 5 ) 

( 6 ) 

Hence the adiabatic 


pV 7 = const., 


c_ 

Y = ~, 


(7) 


^ entical ^ the equation p = a P v in the text, as the 
density p is reciprocal to the volume V. 

to^lO) 1 ^ 0 ^ 0 ™' 1 ^ caloulated “ Appendix, 3; according 

c= I*. 

V dp 

F r the isothermal law, p = ap , this means 
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while, for the adiabatic law, p = apY, one finds 



which is considerably larger; e.g. for diatomic molecules (air) 
experiment, and kinetio theory as well, give y = g = 1*4. 

10. (V. p. 45.) Thermodynamics of irreversible processes 

Since I wrote this section of the text a new development of 
the descriptive or phenomenological theory has come to my 
knowledge which is remarkable enough to be mentioned. 

It started in 1931 with a paper by Onsager in which the 
attempt was made to build up a thermodynamics of irreversible 
processes by taking from the kinetic theory one single result, 
called the theorem of microscopic reversibility, and to show that 
this suffices to obtain some important properties of the flow of 
heat, matter, and electricity. The starting-point is Einstein’s 
theory of fluctuations (see Appendix, 20), where the relation 
S = fclogP between probability P and entropy S is reversed, 
using the known dependence of S on observable quantities to 
determine the probability P of small deviations from equili- 
brium. Then it is assumed that the law for the decay of an 
accidental accumulation of some quantity (mass, energy, tem- 
perature, etc.) is the same as that for the flow of the same 
quantity under artificially produced macroscopic conditions. 
This, together with the reversibility theorem mentioned, 
determines the main features of the flow. The theory has been 
essentially improved by Casimir and others, amongst whom 
the book of Prigogine, from de Donders’s school of thermo- 
dynamics in Brussels, must be mentioned. Here is a list of the 
literature: 

L. Onsager, Phys. Rev. 37, p. 406 (1931) ; 38, p. 2265 (1931). 

H. B. G. Casimir, Philips Research Reports, 1, 185-96 (April 1946); 

Rev. Mod. Physics, 17, p. 343 (1946). 

C. Eckhart, Phys. Rev. 58, pp. 267, 269, 919, 924 (1940). 

J. Meixner, Arm. d. Phys. (v), 39, p. 333 (1941); 41, p. 409 (1943); 

43, p. 244 (1943) ; Z. phys. Chem. B, 53, p. 236 (1943). 

S. R. de Groot, L'Effet Soret, thesis, Amsterdam (1946) ; Journal 

de Physique, no. 6, p. 191. 

I. Prigogine, Etude thermodynamique des pMnomknes vrriversibles 

(Pads, Dunod; Li4ge, Desoer, 1947). 
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11. (VI. p. 47.) Elementary kinetic theory of gases 

To derive equation (0.1) of the text, consider the molecules 
of a gas to be elastic balls which at impact on the wall of the 
— vessel recoil without loss of energy and 
momentum. If the yz - plane coincides 
with the wall the ^-component of the 
momentum m £ of a molecule is changed 
into — m£; hence the momentum 2m£ is 
transferred to the wall. Let Tiy be the 
number of molecules per unit of volume 
having the velocity vector v. If one 
constructs a cylinder upon a piece of the 
wall of area unity and side v dt } all mole- 
. ,, ... cu ^ es ^ strike the part of the wall 

withm the cylinder in the time-element dt; the volume of the 

C *^ , f t hence the number of collisions per uni t surface 

tS! ft* T d the total mo mentiim transferred 2m^X- 

overTw* 1 ? to . summed over ^ angles of incidence (i.e. 

of the ir pher ,t ’ ! he ? 81llt 18 obvio nsly the same as one-half 
of the sum over the total sphere, namely 

2m 2i & ri y = 2mi^n v , 

vdodWof^r^ 1 ?f mol6CuIes P er of volume, with a 
of V (but any Motion). Now the 'principle 

of molecular chaos’ is used according to which 

? = ? = T 2 = ilF+^+F) = 

Hmico the lest expression fa equal md th6 fa 

ftmfly obtained by Bumming over all velocities 

y-fSoX-jnS*. 

The total (kmetio) energy in the volume y fa 

hooce one obtains the e, nation ( 61 ) of the tent, 

Vp = 1 17 . 


( 1 ) 
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gas at a fixed empirical temperature & satisfy pV = const.). 
Then one obtains 

pV — RT, U = $RT, (2) 

as stated in (6.2) of the text. 


12. (VI. p. 60.) Statistical equilibrium 
If E depends only on p, not on x, the equation \H, /] = 0 

V-o, 


reduces to 


P- 


3x 


( 1 ) 


and is equivalent to the set of ordinary differential equations 

dx _dy _dz 

Px Py Pz { 

By integrating these (p is constant) one obtains the general 
solution of (1) as an arbitrary function of the integrals of (2), 

/_•( p.m). (3) 

where m = XAp. (4) 

Now if the gas is isotropic, / can depend only on p 2 and m 2 , 
and if it is to be homogeneous (i.e. all properties are independent 
of x), m 2 cannot appear; hence 

/ = 0(p 2 ) = \(H), (6) 

as stated in the text. 


13. (VI. p. 61.) Maxwell’s functional equation 

To solve the equation (6.10) it suffices to take = 0; putting 
= y, one has 


f(x+y) = <f>(x)<j)(y). 

(1) 

Differentiating partially with respect to x, 


f'(x+y) = <f>'(x)<f>(y), 

(2) 

and dividing by the original equation 


f'{x+y)_ </>’(x) 
f(x+y) <f>(x) ' 

(3) 
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r- 

/»/ / V 


/(y) ~ “* 


(4) 


where 0 is a constant. By integration, 

f(y) = ae-fo = e oc- js» (5) 

whioh is the formula (6.11) 0 f the text. 

14 tion I P 62,) The method of the most probable distribu- 

^ Probability of a distribution of 

»,of ,La to tC™?S 0 w f; , i , *‘ f “r *“ “ tte tat 

we first take the narticles in o' Wa j" " ‘ ~^~ njv = n )‘ To do this 
distribution (n, n 9 n ) is ^ ed _? rder ; tllen the probability of 
^ 2, -’ %)l8) ac cordmg to the multiplication law, 

<^n-c» n = a>5V£*...G$r, 

*h -- ' 


^2 


wherA ,, N 

malized so that v ° lumes of the cells, nor- 

asked for, we have to des SJlfi To obtain the probability 
one performs first all « f ? + ?* ed order of the Panicles. If 

as all those distributions, wlfi^h S/o^ 6 ? t0 ° CaB6S ’ 

Particles in each cell count rmiJ™ m? by permutin g the 
divide nl by the number of all Tberefore one has to 

that is by 

%) (1) 

ST “ ^ »• •— I term fa th. polynonrial eip , n _ 

'£» P(ni ’ n * ’*>-2 sAn-r-st 


»»•••»* 


,«l ! — %!* 


W ■. ~ ( a, l+ w 2+ — -)-£Oj V ) re = 1. 

formato^^^/S^^^alby Stirling's 
°8 »0 Ioga.2.3...») = logl-Hog 2 +log 3 +... + log,, 
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and replace the sum j log h by the integral 

n 

j log® dx = n(log»— 1). 
o 

A more satisfactory derivation is the following : One can represent 
n\ by an integral and evaluate it with the help of the so-called 
method of steepest descent, which plays a great part in the 
modem treatment of statistical mechanics due to Darwin and 
Fowler (see p. 54). The approximate evaluation of n\ may serve 
as a simple example of this method. 

If the identity 

A- (e~ x x n ) = —er x x nJ r ne- x x n - 1 
dx 


is integrated from 0 to oo and the abbreviation (r-function) 


I>+1) = J e~ x x n dx (n> — 1), 


used, one obtains r(n-)-l) = nT(n). 


(2) 

(3) 


As r(l) = 1 , one has 

T(2) = 1 . r(l) = 1, T(3) = 2T(2) = 1.2, 

r(4) = 3r(3)= 1.2.3, 

and in general r(»+l) = n\. (4) 

The integral (2) can be written 


oo 

r(»+l) = J eJ^dx, f(n, x) = — a;+»loga;. 


( 6 ) 


The funotion f(n, x) (hence also the integrand) has a maximum 
where „ 

/'(*) = - i+5 = o, 

JO 


i.e. at x = n, and 

f(n) = — n+n\ogn, 


1 
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fl of the roa™ 

/(*) = -n+nlog^- -L (x-n)*+..., 

and one has 2n 

T(n+1) = e-th+nic* „ j c ^_ n) . /are(1+ ^ ^ 

order which can be easily 
-u these are neglected the integral becomes 

for large w. Hence 

and W,TT +1)==V(2 ^ + - (6) 

where the h^h^wr^^^^+r 1 (7) 

Thus the logarithm of th^rXbffity^ITbe^tten 

logP = 2 <f> s in a ) -j- const., (g\ 

(8) and (9) are for 'LL’ (9) 

the text. ’ q l “ s ’ e^alent to formula (6.17) of 

ditioM (“i?, OlmTt£ e te^rSSy 0f 108 P th6 °° n - 

.?,*• = ”, !».«. = cr. (10) 

Without using the special fx™ j 

s me special lorm of <f> a! one obtains 

^ ==A+ ^* (11) 

(9) one hir W ° Lagrangian faotors - For the special function 

fU 


H, . 

= log^-log^-i, 

and if this is substituted in (11) with A+l = _ a> 

log» s = logco s + a _^ sj 

«« is. fe, eqeal *,,, t£ d^ 18) ^ ^ 


( 12 ) 


(13) 
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If on© has two sets of systems A and B, as discussed in the text, 
there are three conditions 


2VU) 


nm 


NW 




2 = n^\ 2 < B) = n®, 2 2 < B H B) = U 9 

,Ba l r=l r-1 r=»l 

and therefore instead of two multipliers three, A W) , A®, /} ; and 
one obtains, with A^-f- 1 = — A®-)- 1 = — a®, the formulae 
(6.19) of the text, which show that j8 is the equilibri um para- 
meter, a function of the (empirical) temperature & alone. 

In order to see that j8 is reciprocal to the absolute tempera- 
ture one must apply the second theorem of thermodynamics, 
which refers to quasi-static processes involving external work 
(for instance by changing the volume). 

By an infinitely slow change of external parameters a x ,a „..., 
the energies of the cells e r will be altered and at the same time 
the occupation numbers n r ; the total energy will be changed by 

dU = 2 Ur 2 ? €rdnr ’ ( 14 ) 
T O G 

while the total number of particles is unchanged, 
dn = 2 dn r — 0. 

r 

The first term in (14) represents the total work done 
dW = —7i ]?f a da a , 


where 


» 1 ■v' 8e 

f° = -nZ n 'W, 


(15) 

(16) 
(17) 


is the average force resisting a change of a„. Then the second 


term in (14) 


= J,e r dn r 


(18) 


must represent the heat produced by the rearrangements of the 
systems over the cells. 

The corresponding change of logP is obtained from (8) 
and (11), 


dlogP = ^ = 2 )dn r , 

which in virtue of (15) and (18) reduces to 
dlogP = pdQ. 


( 19 ) 
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This shows that )S dQ is a total differential of a function depend- 
ing on /?, a 2 ,..., and that is the integrating factor. 

Hence the second law of thermodynamics is automatically 
satisfied by the statistical assembly, and one has, with the 
notations of section V, 

dQ = \d4, with A = <f> — logP; 

P 

then (5.25) and (5.26) give, with C = Ilk, logO = 0, <J> = 1, 
and (5.27) 2 

T = S-8 0 = hf> = klogP. (20) 

i is called Boltzmann’s constant. 

How the change of energy (14) becomes 

dU= -n'Zf a <to 0 +TdS. (21) 

<r 

If one has a fluid with the only parameter a , = V, the corre- 
sponding force is the pressure 




( 22 ) 


and one obtains the usual equation 

dU = —pdV+TdS. (23) 

one amp!™ 0 ^ enera f ex pression (21) one sees easily that 
function S ‘sum-ov^S’f termS ° f ^ S °' CaUed P artition 

Z = X w r e ~^ £r - (24) 

Por, from (10) and (13), r 

n == e“ ^ c i) r e~fer — e a^ 


hence 


^ — e “ 2 € r <O r e~P e r = gtt ^ 

r 00’ 
a = log ft— log 2. 


\ow — Aug 

(8), (9), (17)^’ ^ Slmple caIcula tion, from (19), (20) with 



( 25 ) 
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and du= — '2 f f <T da <r +Tds. (26) 

a 

The simplest thermodynamical function, from whioh all 
.others can be derived by differentiation, is the free energy, 

f = u—Ts = —JcTlogZ, i 
df= — ^f,da,—sdT\ j ( 27 ) 

hence f, = — d -L = —hT dlogZ , 

Jc da, 8a, ’ 

while 8 = —dfjdT leads back to the second formula (25). 

The application to ideal gases may be illustrated by the simp- 
lest model where each particle is regarded as a mass point with 
coordinates x,y,z, momenta p x ,p y ,p s , and mass m. Then, 
according to Liouville’s theorem, one has to take as cells cu a 
elements of the phase space dxdydzdp x dp y dp z and replace the 
sums by integrals. The energy is (pl+pl+p%)/2m. Then the par- 
tition function (24) becomes 


The integration over the space coordinates gives V, the volume. 
If one puts A /(jS/2m)p a! = £,..., one has 

Z = Vi^jj JJJ er<P+'o*+l*) dgdrjdC, 

the integration extended for each variable from — oo to + 00 . 
The integral is a constant which is of no interest as all physical 
quantities depend on derivatives of Z. Henoe, with = ( kT)- 1 , 

f= —kTlogZ = — kTlogV— pTlogAT+const., 
from which one obtains 

p = - n § = ^> S== ~w = ^ogV+mogkT+1), 

u = f+Ts = §kT-\- const. 

These are the well-known formulae for an ideal monatomic gas: 
Boyle’s law, the entropy and energy per atom. The specific 
heat at constant volume is 

c v = < ^ = % nk = 


if n refers to one mole. 
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15. (VI. p. 54.) The method of mean values 
The method of Darwin and Fowler aims at computing the 
mean value of any quantity f(n r ), depending on the occupation 
number n r of a cell for all possible distributions n^, n 2 ,...,n N , 
satisfying the conditions 


J,n r = n, 2 e r n r = U > (!) 

r r 

that is, the quantity 

7K)= 2 P(%, %,..., n A ,)f(n r ), (2) 

where P is the probability of the distribution n v n 2 ,..., n N , defined 
by equation (6.15) or 14,(1). 

We consider the function F(z) defined by (6.22), 

F{z) = (3) 

and assume that a very small unit of energy is chosen so that all 
the e r are positive integers, which may be ordered in such a way 

t at 6j ^ € 2 ^ e 8 ^ ... ^ e N ' } also, by choosing the zero of energy 
suitably we can arrange that = 0. 

Then we expand {F(z)}» into powers of z according to the 
multmomial theorem and obtain a series of terms 


by collecting all these terms with the same factor we obtain all 

»*■•■.%) which belong to the same value of 
U = 

mvltoSftn-? 1 each /K) “ (2) and obtain in this 

di8 “ n8 which * 

~ coefficient of z u in {F(z)} n . 


(4) 

is taken round a closed contour surrounding 
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the origin 0 in the 2 -plane. The integral can be evaluated 
approximately by the method of steepest descent which we have 
already explained, for real variables, in Appendix 14, for the 
T-function. 

The first step is to express the integrand in the fa™ 
z^+^>{F{z)Y = G( 2 ) == »logJ , (z)-(C/+l)logz. 


Since 


F'(Z) _ J 6 r**' 
F(z) 2 **' 

r 


and 







2 e »* e * V 

</(?**)]’ 


both log F(z) and its derivative increase monotonically from a 
finite value to oo as z moves along the real axis between 0 and 
oo. Also — log z and its negative derivative 2 r 1 decrease mono- 
tonically along the same path. Hence 0(z) can have only one 
extremum, a minimum, on the real axis between 0 and 00 , and 
this minimum will be extremely steep if n and U are large. 

Also let z 0 be the point of the real axis where the minimum 
happens to be; then at this point the first derivative of G(z) 
vanishes and the second is positive and very large. Hence in the 
direction orthogonal to the axis the integrand must have a very 
sharp maximum. If we take as contour of integration a circle 
about 0 through z 0 , only the immediate neighbourhood of this 
point will contribute appreciably to the integral. 

The minimum z 0 is to be found as root of the equation 


G'(z) = 




(5) 


and one has 



This shows that for large U and n a proportional increase in U 
and n will not change the root z 0 , while G"(z 0 ), which is positive, 
can be made arbitrarily large. 
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Putting z = z 0 +iy one obtains for the integral (4) 

00 

^ p = JL J e-4 0 w; <j!y, 

— oo 

where the terms of higher than seoond order are omitted and the 
limits of integration are taken to be ±oo because of the sharp 
drop of the exponential function. This gives 

<6) 

!7+ 1 can be replaced by U, because of the smallness of the energy 
unit chosen; if one puts 

*o = er* (7) 

one has, for N -+ oo, 

m = I o> r %' = I <D r e-fe' = Z(P), (8) 

r r 

which shows that the function F(z) is equivalent to the partition 
function introduced in (14.24), p. 158. 

If one now takes the logarithm of (6) the leading terms are 
log^ P = pU +n log Z. 

On the other hand, one has from (5) to the same approximation 

U = nz 0 — {Zo) = -1^- n dlo 8 z / 9) 

°F(z 0 ) Zd/3~ nr ~dp~’ (9) 

in agreement with (14.25); hence 

Iog <§ p= "( IogZ ^Tr)- < 10 > 

Comparison with (14.25) shows that the entropy in this theory 
is to be defined by 

S=zns=: ilogJP, (11) 

while m the Appendix, 14, the definition was S = k log P, where 
means the maximum value of the probability. 

Thus it becomes clear that owing to the enormous sharpness of 
the maxmram it does not matter whether one averages over all 

STStTJi? 0U l°? y the 8tate of maximum probability. 

®d thL^f ,**’ “ ° f the mo8t P robable distribution 

^d that of mean values, do not differ as much as it appears. 

oth use asymptotic approximations for the combinatorial 
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quantities: either for each factorial in the probability before 
averaging, or for the resultant integral after averaging. The 
results are completely identical. Yet there are apostles and dis- 
ciples for each of the two dootrines who regard their creed as the 
only orthodox one. In my opinion it is just a question of training 
and practice which formalism is more convenient. The method 
of Darwin and Fowler has perhaps the advantage of greater 
flexibility. The partition function is nothing but a ‘generating 
function’ for the probabilities, and allows the representation of 
these by complex integrals. In this way the powerful methods 
of the theory of analytic functions of complex integrals can be 
utilized for thermodynamics. 

16. (VI. p. 56.) Boltzmann’s collision integral 

The collision integral (6.24) can be derived in the following way. 

The gas is supposed to be so 
diluted that only binary en- 
counters are to be taken into 
account. Then the relative 
motion of two colliding particles 
has an initial and a final straight 
line asymptote. 

To specify an encounter we 
define the ‘cross-section’ as the 
plane through a point 0 with a 
normal parallel to the relative velocity ?i— of two particles 
before an encounter and introduce the position vector b in this 
plane. We erect a cylindrical volume element over the area db 
with the height |£i— ? 2 | dt\ then all particles in this element 
having the relative velocity ? 2 will pass through db in time 
dt. The probability of a particle 2 passing a particle 1 at O 
within the cross-section element db is obtained from the 
product /(l) dx 1 d| 1 /(2) dx 2 d? 2 by replacing dx 2 by the volume 
of the cylinder |5 X — 1* 2 | dbdt, 

/(l)/(2)l5i-? a | dbdx^^dt. 

Every encounter changes the velocities and removes therefore 
the particle 1 from the initial range. The total loss is obtained 
by integrating over all db and d§ 2 : 

dx^de JJ7(i)/(2) |5 X — ? 2 |dbd| 2 . 



(i) 
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But there are other encounters such that the final state of the 
particle 1 is in the element they are called inverse en- 

counters. 

If the final velocities of the direct encounters are %' 2 the 
laws of collision (conservation of momentum and energy) allow 
one to express ?i, % 2 in terms of § 1# and two further parameters 
(the components of b in the cross-section plane). These relations 
are linear in § l9 and may be shortly written 

(§u == (2) 

where & represents a 6 X 6 matrix. It is obvious that the solu- 
tions of these equations for and § 2 in terms of and % 2 must 
have the same form; that means that JS?-* 1 = JS?, so that JS? 2 = 1 
and \&\ = 1 or 

(3) 

Further, the elementary theory of collisions (conservation of 
energy and momentum) implies 

16-SI - 16-61- (*) 

Hence the number of inverse encounters is 

dx^dt JJ /'(l)/'(2)fo-s a | dbd% 2 , (5) 

where /'( 1) means f(t, x x , % x ), being the linear function of 
?i»?a given by (2). 

Combining (1) and (5), one obtains for the total gain of par- 
ticles (1) in dx 1 d^ 1 , per time-element dt, 

dx^dt // {/'(l)/'(2)-/(l)/(2)}|| 1 -? a | dbd? a . (6) 

This has to be equated to the change of /(l) calculated without 
assuming interactions, namely, 

y®dx 1 as l dt=l°M-[E,m]}ix t ds l it. (7) 

The results are the combined formulae (6.23) and (6.24) of the 
text, 


(S) 
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17. (VI. p. 57.) Irreversibility in gases 
Assuming no external forces, the Hamiltonian of a particle 
is H = - |-p a , hence Boltzmann’s equation (6.23), or (Appendix, 
16.8) reduces to 

= /J {/'(l)/'(2)-/(l)/(2)}|5 1 -? 8 |dbd| 2 . 

( 1 ) 

If now the entropy is defined by (6.25) or, using the velocity 
instead of the momentum, by 

^ /(iJiog/C 1 ) (2) 

one obtains 

and substituting 8f(l)jdt from (1) 

-*//// {l+log/(l)}{/'(l)/'(2)-/(l)/(2)}x 

Xl^-Saldbdx^A (3) 
Here the first integral can be written 

jjSi-^ifWogfWdx^ 

and transformed, by Gauss’s theorem, into a surface integral 
over the walls of the container, 

JdaJ ^.v/Wlog f(l)c% v 

where v is the unit vector parallel to the outer normal of the 
surface, da the surface element. 

The inner integral is n times the mean value over all velocities 
of ?i.vlog/(l), where n = J /(l) is the number density. But 
this average vanishes at the surface which is supposed to be 
perfectly elastic and at rest, external interference being excluded; 
for the numbers of incident and reflected particles with the same 
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absolute value of the normal component of the velocity, |?.v|, 
will be equal. 

Hence there remains only the second integral in (3). This can 
be written in four different forms, namely, apart from the one 
given in (3), where the factor 1+/(1) appears, three others where 
this factor is replaced by l+/(2) or 1+/'(1) or l+/'(2). Foritis 
obvious that 1 and 2 can be interchanged as the integration is 
extended over both points in a symmetric way; and the dashed 
variables can be exchanged with the undashed ones as 

lfi-51 = I5i-5«l 

(see Appendix, 16.3, 4). Hence 

W = JJJJ { 1 °g/( 1 )+ 1 °g/(2)-log/'(l)-log/'(2)}X 

X{/'(l)/'(2)-/(l)/(2)}|5 1 -5 2 |ibix 1 ^ 1 d? a 

or 

§ “ I//// 


(4) 

l0g 7(l)/(2 2 ) ) i8 P ositive or negative according as /'(l)/'(2) 

ia greater or smaller than /(l)/(2); it has therefore always the 
same sjgn as /'(l)/'(2)-/(l)/(2) ; and one obtains 

dS 

the = sign can hold only if 

/'(!)/'( 2) = /(l)/(2) (6) 

°r 1 °g/'( 1 )+log/'(2) = log/(l)+l og /(2). (7) 

One can express this also by saying that 

lo g/(l)+log/(2) (8) 

is a collision invariant. 

The mechanics of the two-body problem teaches that there axe 
only tour quantities conserved at a collision: the three com- 

mo f entum and the total energy 

Hence log / must be a linear combination of these: 

log / = • 5, (9) 

or / = e“ _ t0 m £*+yi. 
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This can also be •written 

f = 


where u = — y and a, = a- 
mj3* ^ 


( 10 ) 


2mjS* 


(10) shows that u is the mean velocity. For a gas at rest (in a 
fixed vessel) one has therefore u = 0, hence y = 0 and 

f = e-*, « = M 2 - (11) 

This is the dynamical proof of Maxwell’s distribution law. 

18. (VI. p. 60.) Formalism of statistical mechanics 

As said in the text, Gibbs’s statistical mechanics is formally 
identical with Boltzmann’s theory of gases if the actual gas is 
replaced by a virtual assembly of copies of the system under 
consideration. Hence all formulae referring to averages per 
particle (small letters) can be taken over if the word ‘particle’ 
is replaced by ‘system under consideration’. One forms the 
partition function (14.24) or (15.8) F(z) = Z{fi), z = e~&, and 
from that the free energy (14.27) 

f=-hT\ozZ, ( 1 ) 

from which all thermodynamical quantities can be obtained 
by differentiation: 

a — — — f = (2) 

0T’ Ia da„ V ' 

This formalism includes also the case of chemical mixtures 
where the number of particles of a certain type is variable. One 
has to know how the quantity Z depends on these numbers; 
then the chemical potentials, introduced in Appendix, 8, are 
obtained by differentiating f with respect to the concentrations. 
We shall mention only the method of the ‘great ensemble* which 
can be used in this case. 

In the theory of non-ideal gases the Hamiltonian splits up 
into a sum 

, N 

(3) 


and the partition function into a product 

Z = J J exp | — (]3/2m)^pj|j dp v ..dp N X 

X J Jexp{-jBC7(x ll ... > x iV )}dx l ..^Xy. 
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The first integral can easily be evaluated and gives 

== (277?ni feT) 8 "? 2 ; 

hence one has 
where 

Q = f ( Wf exp{— U(x v ...,x N )/kT} dx...d^ N , (5) 

as in (6.35) of the text. 

The method of Ursell for the evaluation of this integral applies 
to the case where the potential energy is supposed to consist of 
interaction in pairs between the centres of the particles, 


U = YO 




Then one can write 

e-JStf = 


% = O(^). 

'-IK 1 -/#). 


( 6 ) 


(7) 

( 8 ) 

(9) 


S' 

where f i} = i_ e -j3®«. 

The product (7) can be expanded into a series 
e ' PU= 

and the problem of calculating Q is reduced to finding the 
‘cluster integrals’ 

j...ff ij dx 1 ...dx N , [:.j f ij f hk dx v ..dx N , (10) 

which are obviously proportional to F^- 1 , V N ~ 2 Hence one 

obtains for QV~ N an expansion in powers of V~ x which holds 
for small interactions small implies/^ small): 

« = mi 

Then (1) and (4) give 

/= —icT l 0 g(27rmiT)+logQj, (12) 

hence 

p=-V-kT dl °Z Q - NkT ll A , B ) 

* 8V- ,cl ^v~ = ~[ 1 -v + W~"j i (13) 

where A = oc/N, B = (a 2 — 2fi)/N 9 ... . That is the formula (6.36) 
given in the text. 

a actual evaluation of the cluster integrals is extremely 
difficult and cumbersome. The analytical properties of the power 
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series (11) have been carefully investigated by J. Mayer, and by 
myself in collaboration with K. Fuchs, The theory has been 
generalized so as to include quantum effects by TJhlenbeck, 
Kahn, de Boer. Here is a list of publications: 

H. D. Ursell, Proc. Camb . Phil. Soc. 23, p. 685 (1927). 

J. E. Mayer, J. Chem. Phys. 5, p. 67 (1937). 

J . E. Mayer and P. J. Ackermann, ibid. p. 74. 

J. E. Mayer and S. F. Harrison, ibid. 6, pp. 87, 101 (1938). 

M. Bom, Physica, 4, p. 1034 (1937). 

M. Bom and K. Fuchs, Proc . Boy. Soc. A, 166, p. 391 (1938). 

K. Fuchs, ibid. A, 179, p. 340 (1942). 

B. Kahn and G. E. TJhlenbeck, Physica , 4, p. 299 (1938). 

B. Kahn, The Theory of the Equation of State. Utrecht Dissertation. 

J. de Boer and A. Michels, Physica , 6, p. 97 (1939). 

S. F. Streeter and J. E. Mayer, J. Chem. Phys . 7, p. 1025 (1939). 

J. E. Mayer and E. W. Montroll, ibid. 9, p. 626 (1941). 

J. E. Mayer and M. Goeppert-Mayer, Statistical Mechanics , J. Wiley 
& Spns, New York (1940). 

J. E. Mayer, J. Chem . Phys . 10, p. 629 (1942). 

W. G. MacMillan and J. E. Mayer, ibid. 13, p. 276 (1945). 

J. E. Mayer, ibid. 43, p. 71 (1939) ; 15, p. 187 (1947). 

H. S. Green, Proc. Roy. Soc . A, 189, p. 103 (1947). 

J. de Boor and A. Michels, Physica , 7, p. 369 (1940). 

J . do Boer, Contributions to the Theory of Compressed Oases. Amster- 
dam Dissertation (1940). 

J . Yvon, Actuality sdentifiques et industrielles, 203, p. 1 (1935) ; p. 542 
(1937); Cahiers de physique, 28, p. 1 (1945). 

19. (VI. p. 62.) Quasi -periodicity 

The state of a mechanical system can be represented by a 
point in the 6A r -dimensional phase space p, q, and its motion by 
a single orbit on a ‘surface 9 of constant energy in this space. 
FoU owing this orbit one must come very near to the initial point; 
the time needed will be considerable, in the range of observa- 
bility. This is the quasi-period considered by Zermelo. Yet 
there are much smaller quasi-periods if one takes into account 
that all particles are equal and indistinguishable; the gas is 
already in almost the same state as the initial one if any particle 
has come near the initial position of any other. Then the orbit 
defined above is not closed at all, yet the system has performed 
another kind of quasi-period. These periods are presumably 
small; I cannot give a mathematical proof, but it seems evident 
from the overwhelming probability of distributions near the 
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most probable one. Einstein has this quasi-period in mind It 
is certainly extremely short in the scale of observable time- 
intervals, and one can therefore say that the representing ‘point’ 
sweeps over the whole energy surface if this point is defined 
without regard to the individuality of the particles (i.e. if an 

TT 0 ™ ^ um ^ )er single points corresponding to permutation 
ot all particles are regarded as one point). 


20. (VI. p. 63.) Fluctuations and Bro wnian motion 

The statistical conception of matter in bulk implies that 
spontaneous deviations from equilibrium are possible. There are 
several different types of problems, some of them concerned 
with the deviations from the average or fluctuations found by 
repeated observations, others with actual motion of suspended 
visible particles — the Brownian motion. 

The simplest case of fluctuations is that of density, i.e. of the 
number of particles in a small part coV of the whole volume V. 
One has in this case two cells of relative size o> and 1 -a>, and the 
probabihty of a distribution n x ^ = n-n x , is according to 
(6.15) or (14.1) & 


F(%) “ n x \ (n-n x )l (1) 

The expectation value of n ± found by repeated experiments is 

555 2 ^ oi ni (l— 


ni—o 


% K- 1) ! {(»- /)- K-lf! wWl 1(1 - a> ) (n - 1Mni_1) - 

According to the binomial theorem this reduces to 


. , x , n 1 = nu>, (2) 

as might be expected. ' 

In order to calculate ^ we note that S^fa-l) can be found 
exactly the same way as n^, namely, 

%(«!—!) = 2 


in 


(n—2)\ 


— n{n— l)w a V ■ 

*=& (%~2)!{(?i— : 2)— («!_ 2)}! X 

x a) ni “ 2 (l — C0 )(n- 2 Hni-2) j 
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whence ^i(%— 1) = n{n— 1)«®. (3) 

Therefore 

wf = %(%— l)+«i = n(n— l)aj 2 +»a>, (4) 

so that the mean square deviation is 

(A»i) a = (%— %) a = (%) 2 = nw—nu>* = »o>(l— a>). (5) 

If oj is a small fraction, one obtains the well-known fluctuation 
formula for independent events 

(A»i) a = %. (6) 

This is directly applicable to the density fluctuation of an ideal 
gas and can be used to explain the scattering of light by a gas, 
as observed for instance in the blue of the sky (Lord Rayleigh; 
Atomic Physics , Appendix IV, p. 280). 

There are also fluctuations of other properties of a gas. As 
the state of a fluid is determined by two independent macro- 
scopic variables (p, V for instance), it suffices to calculate the 
fluctuation of one further quantity. The most convenient one 
is the energy. 

The following consideration holds, however, not only for ideal 
gases, but for any set of independent equal systems of given 
total (or mean) energy; it supposes only that the distribution 
is canonical. 

Then all averages can be obtained with the help of the par- 
tition function (14.24) or (15.8), 

Z(P) = 2 co r e-fa, P = -£p- ( 7 ) 

In particular the mean energy is (14.25) 


u = e = 


2<o r e r e~fa z , 

2 <o T e~fa Z 


dlogZ 

~dfT’ 


( 8 ) 


and the mean of its square 


2 oj T ele-fa 

.2 _ _r 

2 o> r e~fa 



( 9 ) 
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Hence the mean fluctuation of energy 

= £=!)* = ?_ (i ). = 

ZZ'—Z'* 


or with (8) 


2 2 


= -(^\ 
dP \Z}’ 


(Ae ) 2 = 

dp’ 


( 10 ) 


t«nce^ is known as a function of temperature, 

«££>£ fo™ °?T * to fluotaati ® if differentiation. For 

SL " «?<“»* — <h * - WDfH, hence 

tion of radiation ' . Another application, to the fluctua- 

on ot radiation, is made m section VIII p. 79. 

which n c a ^+ e l to f etermine the Nations of a part of a body 

— thJ 

apS'S ^‘™ ted * mo8 ‘ ‘■’SMiots method which can he 
ePPtodm such cases. It connate in reverring Boltanann'a 

3 = hlogP, (11) 

determining ™ «* — 

P = e«. (I2) 

£ t 1464 tato * tut stm 

them; then one hte for ftem^t'S 1 ^ 6 '* 3 ' “ °” e ° f 

If the whole system is adiabatically isolated one has 

^ A u t = 0. 

By adding up all fluctuations one gets for the entropy 

( 13 ) 

where the abbreviations 


l/a a iS 
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axe used. According to the second law of thermodynamics one 
has for constant volume 


_dU 

~ T 5 8U~ T* 


hence 


7 = 



2k8U\Tf 


1 M-Q 1 (IS) 

2 k dU ~ 2kT\’ {l6) 

8T 


where c v = dUjdT is the specific heat for constant volume. 
Substituting (13) in (12) one obtains approximately 

P = P 0 e~y 

hence the mean square fluctuation of energy in one (macroscopic) 
cell is 

_ /a*-* *6 • . „ 

(*tf - « - - ~ = J r* * = h 

{ -oo 7 

— 00 

or (S53* = 42%. (16) 

This result is formally identical with that for an ideal gas ob- 
tained above, yet holds also if c v is any function of T. 

In a similar way other fluctuations can be expressed in terms 
of macroscopic quantities. 

We now turn to the theory of Brownian motion which is also 
due to Einstein. His original papers on this subject are collected 
in a small volume Investigations on the Theory of the Brownian 
Movement , by R. Fiirth, translated by A. D. Cowper (Methuen 
& Co. Ltd., London, 1926) and make delightful reading. Here 
I give the main ideas of this theory in a slight modification 
formulated independently by Planck and Fokker. 

Let f(x, t) dx be the probability that the centre of a colloidal 
(visible) particle has an ^-coordinate between x and x-\-dt at 
time t. The particle may be subject to a constant force F and 
to the collisions of the surrounding molecules. The latter will 
produce a friction-like effect; if the particle is big compared with 
the molecules, its acceleration may be neglected and the velocity 
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component in the ^-direction assumed to be proportional to the 
f0r0e £ = BF, (17) 


where B is called the ‘mobility’ . Apart from this quasi-oontinu- 
ous action, the collisions will produce tiny irregular displace- 
ments which can be described by a statistical law, namely, by 
defining a function <f>(x) which represents the probability for a 
particle to be displaced in the positive ^-direction by x during a 
small but finite interval of time t. 

Then one obtains a kind of collision equation (which is m'rn pW 
than Boltzmann’s in the kinetic theory of gases, as no attempt 
is made to analyse the mechanism of collision in detail) : The con- 


vectiveinoreaseof/(a:,f)inthetime-intervalT,T— = r(~ 4-— 

dt \8t^dx ) 

is not zero but equal to the difference of the effect of the collisions 
which carry a particle from x 1 to x and those which remove the 
particle at x to any other place x t : 


fi-p \ 00 

« + teV" / {/W(z-* 1 )-/W(*i-aO} dx x 

' —CD 


oo 

= j {f(x-x')-f(x)}<f>{x')dx'. (18) 

—00 

#*) may be normalized to unity and the mean of the displace- 
ment and of its square introduced by 

? « „ 

J ^(*) dx = 1, J x<f>(x) dx = A®, f x *<f>(x) dx = (A®j®. 

60 — °0 * 

—00 

(19) 

Further it may be supposed that the range of <f>(x) is small* then 
Tm™77 f(X 7 X \° n the ri « ht - hajld sid e of (18) and obtain 

8,1 wUoh ’ wia < 17 > TO. •» 


f+(iSJ’_d2W_i<A*W . 

ot \ n- I As* O I ~ = V. 


( 20 ) 


T Jdx 2 r dx 2 
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Consider first statistical equilibrium; then 


.0/ 


,s 2 / 


BF%—^-(Ax)*¥,= 0. 

8x 2 t k ' 8x 2 


( 21 ) 


Now the coordinate a; of the colloidal particle can be included in 
the total set of coordinates of the whole system, if a term — Fx 
is added to the Hamiltonian, so that the canonical law of distri- 
bution contains the factor efi Fx ,fi = IjlcT. Hence the solution 
of (21) must have the form 


so that 


f = f 0 e? Fx , 

= RF—. 
8x 2 p 8x 


( 22 ) 


If this is substituted in (21), one finds 

D 




0 


(23) 


We consider now the mo'tion of the particles without an 
external field (F = 0), under the action of the collisions only. 


Then (20) reads 


8f = D 8 2 f 

8t 8x 2 ‘ 


(24) 


This is the well-known equation of diffusion. Einstein’s m».in 
result consists in the double formula (23) which connects the 
mean square displacement with the coefficient of diffusion!) and 
with temperature and mobility. 

If the particle is known to be at a given position, say x = 0, 
at t = 0, the probability of finding it at x after the time t is the 
following solution of (24): 


/(*. <) 


i p -x l l&JDt. 

V(4 -nDt) 


(25) 


the mean square of the coordinate, or the ‘spread’ of probability 
after the time t is found by a simple calculation: 


(Aa:(i)} 2 = J x z f(x, t)dx= 2 Dt, (26) 

—00 

which for t = t is equal to the mean square displacement (A*) 2 
given by (23). 

These formulae can be used in different ways to determine 
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Boltzmann’s constant k, or Avogadro’s number N = R/k (where 
R is the gas constant per mole), i.e. the number of molecules per 
name. A static method consists in observing the sedimentation 
under gravity of a colloid solution; then F = —mg, where m is 
the mass of the colloid particle, and the number of particles 
decreases with height according to the law (22), which now reads 
n = n 0 

In order to apply this formula one has to determine the mass. 
•For spherical particles m = (4^/3)^, where p is the density and 
r ^ radius The mobility of a sphere in a liquid of viscosity v 
has been calculated by Stokes from the hydrodynamics! equa- 
tions, with the result 

< 27 > 

S^cT?i 7 T der g "" ity ' * = 

£= —BF = -9£ r i' 

9 t] 

As f can be measured, r can be found, if p and » are known, and 
finally m. ' 

. Another method is a dynamical one. One observes the dis- 
placements Ax x , Ax 2 ,..,, of a single colloid parti cle in equal inter- 
vals t of time and forms the mean square (A*) 2 . Then using the 
same method as just described for determining the radius r, 
one finds B from (27) and then k from (23). 

In this way the first reliable determinations of N have been 
made. Among those who have developed the theory M. v. 
Smoluchowski has played a distinguished part, while the first 
systematic measurements are due to J. Perrin. 

A new and interesting approach to the theory of Brownian 
motion may be mentioned: J. G. Kirkwood, J. Chem. Phvs 
14, p. 180 (1946); 15, p. 72 (1947). V 

^function 67 ^ Reduction of to® multiple distribution 

tJS 6 tWfi of N particles can be split into two 

parts, toe first being the Hamiltonian H N _^ of IV- 1 particles 
the second the interaction of these with the last particle: 

— ^-i+2^P CA02 + <I)(N) + 2 (1) 
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where *» is the external potential on the particle * and W the 
mutual potential between two particles i and j. 

system ** aPPly ° perator to the equation for the total 

= i E N>fN]- ( 2 ) 

IVom (6.40) we have, for q = N—l, 

Xn/n = In- i* 

Hence 

x»f = |(x»A) = %, 


and 


XwPWw] = Xw[^v-i,///]+Xiv[( l/2m)pW2+0>W,/ JV ]+ 

Here the first term on the right-hand side becomes 

Xw[^2y-i,^v] = [-Htf-i) Xn/n] — [^kr-i»/iv-x]. 

smce H n _ x does not depend on the particle N to which the opera- 
tor xn refers. Further, 


*[£p»44»A,] = *£*£ . J&) - 0; 

for if the integration xn is performed, the result refers to values 
off„ at infinity of the xW and gw respectively, and these vanish 
as there is no probability for particles to be at an infinite distance 
or to have infinite velocities. 

If all this is substituted in (2) one obtains 


1=1 


( 3 ) 


Repeating the same process with xn-i> Xn-z,- one obtains the 
chain of equations (6.44), (6.45) of the text. 


22. (VI. p. 68.) Construction of the multiple distribution 
function 

The fundamental multiplication theorem for non-independent 
events can be obtained in the following way. 

Any event of a given set may have a certain property A or 
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not, A. If £ is another property -we indicate by A B those events 
whioh have both the properties A and B. 

_Then all events can be split into four groups AB, AB, AB, 
AB, -with the probabilities p AB , pjg, p AS , pjp. 

The probability of A is 

Pa = Pab+Pab- W 

On the other hand, if A is known to occur, the cases AB, AB are 
excluded, hence the probability of B is 


Pb{A) = 


Pab 

Pab~\~Pab 


_ Pab 
Pa * 


OT PaPb(A) = Pab, (2) 

which is the multiplication rule; it reduces to the ordinary one 
for independent events if p s (A) does not depend on A and is 
equal to p B . 

This rule can be applied to a mechanical system of N particles 
in the following way. 

Let A signify that q particles are in given elements of phase 
space; the probability of A can be written 


p A =f a d^d^\..d^d^P. (3) 

Let B mean that the element q -\- 1 is occupied. Then AB ex- 
presses that all g-fl elements are occupied, or 

Pab = f a+ i . A^d%®di&+ 1 '>dE<*+n. (4) 

Hence p B {A), the probability for the element g+1 being occu- 
pied, if q particles are in given elements, is 


p B (A) = = Js+i ix<«+ 1 W«+ 1 ) . 

Pa J q 


( 5 ) 


If this is summed over all possible positions and velocities of the 
last particle (?+ 1), the result is equal to the number of particles 
excluding the q fixed ones, N — q\ hence, with the normalization 
described in the text, (6.42) and (6.43), 


(N-q)f a = JJ/ ff+1 dxt«+i)d|to+W = W a+ i, (6) 

which is the formula (6.40) of the text. 

In order to construct the equation (6.44) for the rate of change 
of fq> one has to introduce a generalized distribution function 
which depends not only on the position x and the velocity § 
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but also on the acceleration kj of the particles; the probability 
for a set of q particles to be in the element 


shall be denoted by 


g t (t, x®, %<■», IQ®..., x<a), §M>, yj(a)) dxWd^fyW . 

One has obviously 

/ -?! / g a dtf>...dti^ = U (7) 

Now the motion of the molecules follows causal laws; hence 
the probability f a of a configuration in x, §-space at a time t must 
be the same as that at the time t- f- 8 t of that configuration whioh 
is obtained from the first by substituting x«-|-5 w Sf and |«>-j-v]<®8f 
for x (< > and 1 gw. 

Hence (7) leads to 

J ® f g«(t+ K x+% St, ?+Y] St, 7j) drfU...drf<ti 

= /<?>/ g a (t,*,%,r l )drP...dy{<‘\ 

or / ~ j m 

The integration in the first two terms can be performed with 
the help of (7); that of the last leads to the integral 

(9) 

where the symbol rP is evidently the mean acceleration. 

Hence one obtains from (8) 



The final step consists in using the laws of mechanics for deter- 
mining rP. The equations of motion are (force P< r >) 


«<>■) — 1 / y 8Q<*> 

1 m\A, exW 


■P «1 


(11) 


Now the function f q refers to the case where the positions and 
velocities of q particles are given, the others unknown. Hence 
one has to split the sum (11) into two parts, the first referring to 
the given particles, the second to the rest. For this rest the 
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probability of finding a particle in a given element q -\- 1 is known, 
namely (/ fl+ i// a ) dx^ s+1 5^ a+1) ; hence the average of this sum can 
be determined by integrating over dxta+«<fl-<«+« i.e. by applying 
the operator Xj+v In this way the mean acceleration is found to be 


^ = L y f \ ll2) 

m 8x<<) m /«* 9 \ 2 x 5 ^ * q+r ) ' ' ' 

Substituting this in (10), one obtains 




+ 


_ /a<D<^+b 

+ mA 1 X2+1 \ Sx«> 



whichis easily confirmed to be identical with the formulae (6.44), 
(6.45) of the text. 


23. (VI. p. 69.) Derivation of the collision integral from the 
general theory of fluids 

iProm the standpoint of statistical theory a fluid differs from 
a solid by the absence of a long-range order, so that for two events 
A and B happening a long distance apart one has, with the 
Appendix, 22, p AB = p A .p B ; fo r instance, for large 
IxO-x^l one has/ a (x«x«))=/ 1 ( X (i))/ 1 (x(2)) j while in solids 
this is not the case. 

The distinction between liquid and gas is not so sharp and 
may even be said to disappear above the critical state. However, 
rf one is not specially concerned with these intermediate con- 
ditions there is a wide region where liquid and gas can be dis- 
tinguished by the extreme difference of density. From the 
atomistic standpoint this has to be formulated thus: 

JPVS"?® 1 6nerg7 < * )(x ®’ x ®) between two molecules at 
* and xW decreases rapidly with the distance between their 
mass centres, and (except in the case of ions, where Coulomb 
forces act) a distance r 0 , small by macroscopic standards, may 
be specified, beyond which the interaction may without error 
be assumed to vanish completely. In a liquid proper, there are 
many molecules within this distance r 0 of a given molecule; in a 
gas there are usually none, and the probability that there is more 
than one is very small, except near condensation. The neglect 
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of this small probability is equivalent to the assumption of 
‘binary encounters’ in gas-theory. Green has shown that when 
this assumption is made, on t aking q = 1 in the equations (6.44) 
and (6.46) of the text, 


a) 

^ = ( 2 ) 


one obtains Boltzmann’s collision equations (6.46), (6.47). 

To prove this we first work out the expression S 1 using the 
definition (6.6) of the Poisson bracket and of the operator v, 
(6.39): 






(3) 


(see also 22.13). With the assumption of binary encounters / 2 
can be expressed in terms of A by using the mechanical laws of 
collision. 

Consider the motion of two molecules which at time t have 
positions x®, x (2) , such that |x®— x®| < r 0 , and velocities 
5®, ^( 2 ) > w hile at time £ 0 (< t), when the molecules were last at a 
distance r 0 from another, their positions and velocities were 
x&», xj, 2 > and ^ 1) ,^ 2) . The configurational probability 

fS, x®, x®, 5®, §<») dx®dx®d5®d5® 
must remain unchanged during the interval (t 0 , t) as the motion 
follows a causal law; also, by Liouville’s theorem, the volume in 
phase space dx®dx®d£®d]j® is unaltered. Since, as explained 
above, molecular events in fluids which occur beyond the range 
of interaction must be considered independent, one has 

m x (1) , x®, 5», 5®) = A(« 0 , 4», x«), $*>). (4) 

Next one introduces an approximate assumption which is 
always made in gas-theory, that t 0 , x§\ x(, a) may be replaced by 
t, x® and x® on the right-hand side of (4) (but of course not 
?i x) > by 1-®, ^®). As r 0 is very small the resulting error is of 
microscopic order; nevertheless it is not without importance, 
for it allows small deviations from Maxwell’s velocity distribu- 
tion law (and other ‘fluctuations’), which would otherwise be 
unexplainable, as this law is a rigorous consequence of Boltz- 
mann’s collision equation in equilibrium conditions. 

It remains to calculate and ^ 2) in terms of % (1 \ §® and 
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r = x® — x®, which can be done by using the canonical equati ons 
of motion or their independent integrals (conservation of energy, 
momentum, and angular momentum). The resulting formulae 
are the same as used in Boltzmann’s theory (see Appendix, 16). 
The reduction of S 2 can he performed without making use of 
explicit expressions. One has only to remark that 

Ml**, *»,?». ?») 

now satisfies the equation 

[S 2 ,A] = 0, (5) 

where #2 = ?(5 (1)2 +5 (2)2 )+$(r) (6) 

2 

is the Hamiltonian of the two particles which are considered to 
move independently of all the others. Now (6) becomes 

m (£(2)_P0D) 3/2 00 /jfo (7) 

^ ' Sr dr [d%*> SI;®/ (l) 

We integrate this over then the term with 3/ 2 /S5 (2) on 

the right-hand side vanishes, because there are no particles with 
infinite velocities. The other term, with S/ 2 /S!;®, becomes identi- 
cal with m&y, according to (3), since 

SO _ SO 

Sr Sx®' 

Hence, with (4), 

Si = JJ (? (a) - 5 w ). |{/i& 1 ) )A(& 2) )} drd^\ ( 8 ) 

where the domain of integration over r may be limited by the 
sphere of radius r 0 surrounding x (1) . 

This integration can be performed by imag inin g the sphere to 
be partitioned by elementary tubes parallel to the relative 
velocity g (2) — £ (1 >; 0 ne may then integrate, first over a typical 
tube specified by the cross-section radius b, perpendicular from 
the centre of the sphere to the tube (see Appendix, 1 6), and then 
over all values of b. At the beginning of the tube, where 

(5 (2 M (1) ).r < 0 , 

the interaction between the molecules is negligible, and the 
functions giving and £j> 2) in terms of gw £< 2 >, and r reduce to 
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J- (W and 5 <2) . At the end of the tube the values and % < - 2y of these 
functions have to be calculated from the collision integrals, just 
as in Bol tzmann ’s theory. Thus one obtains 

^ = JJ l? (2) -? (1) l{/i(? (1) ')A(? (2) ')-/i(? (1) )A(r ) )} dbc%» (9) 

which is identical with (6.47) of the text and the collision integral 
in (16.8). 

This derivation is not more complicated than Boltzmann’s 
original one and is preferable because it reveals clearly the 
assumptions made. 


24. (VII. p. 72.) Irreversibility in fluids 

A rigorous proof of the irreversibility in dense matter from the 
classical standpoint seems to be very difficult, or at least ex- 
tremely tedious. Green has, however, suggested a derivation 
which, though not quite rigorous, is plausible enough and 
certainly based on reasonable approximations. 

It has to be shown that the entropy S defined by (7.1) never 
decreases in time, so that 

^ J j (l +log/ iV ) & dxVd%v...dxWdZ,™ ^ 0, 

(l) 


if f N satisfies the equation 

% = [H N ,f N ]+S N (f N+ i), (2) 


which expresses that one particle of unknown position and 
velocity is added to a system of N particles. 

If $ is the total potential energy between the N particles and 
that between the ith of these and the additional particle, 
one has 



0x«> 



Qfs 80 
0§<« ‘ 0xW 


+ 



( 3 ) 


If flhia is substituted in (1) the integrals of the first two terms 
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vanish on transformation to surface integrals. In the last sum 
all terms contribute the same, as f N and f N+1 can be assumed to 
he symmetric in regard to all particles. Hence 

T— artf"* 0 f <>+**&* 

df„ aoouv+n 

Now the reasoning follows very closely that of Appendix, 23, 
where (23.3) was transformed into the integrable expression 
(23.8) with the help of the identity ( 23.7). 

For this purpose one introduces instead of the velocities of the 
two particles (1) and (17+1) appearing explicitly in (4) new 
variables, namely their total momentum m, two components 
of the relative angular momentum a, and the relative energy w, 

m = m (5 w +? (W+1) ), a = — x (1) ) a (JjW- 1 ) J 

w = |(W|8+0(|xCN4-w_ x(W|), 

(^) 

and regards f N+1 as a function of these, so that 

fit+i = /W*» x (iV +W ) % ( -- Ar> , m, a, w). 

Then by direct differentiation it can be verified that 

an equation similar to (23.7). 


rf Sf N+1 8^>0.M+i) _ 

ag» ' 


/ 

gxd) ^ ^ a ^ en fr om it and substituted in (4), the 

only term which does not vanish is found to be 

dS 1c c r 

It = (FZIJi J J (i+iog f N ) x 

(8) 

h^csrr^-r^ the tra i eot ° ries ^ C h would 

betofiowedby the particles numbered (1) and (17+1) if no other 

In in AorentoT^' °“ 6 ^ apply the same reas oning 
as in Appendix, 23, partitioning the xW+W-domain by tubes 

formed by the trajectories of (17+1) relative to (l)f whei 
’ , 816 COnstailt > and one perform the integration with 
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respect to x^ +1 > first along such a tube, then over all values of 
the cross-section b. At each end of the trajectory where the 
interaction « can be neglected, the function f N+1 would 
factorize into provided no other particle were near to 

the particle (i^+ 1). 

This is, of course, not the case; but it seems to be reasonable to 
assume that the factorization is at least approximately correct 
as the action of the rest will nearly cancel. This is the simplifica- 
tion made by Green. It is clear that it could be corrected by a 
more detailed consideration; but let us be content with it. 

Since the sphere around x^ in which is effectively 

different from zero is of microscopic dimensions, the values of 
xW- 1 ) and x (1) need not be distinguished, nor the instants when 
these points are reached. The initial velocities ?»>', ptf+i)' mus t, 
however, be determined from the actual final velocities from the 
‘conservation 5 law, i.e. the de fini tions (5) for constant m , a w: 

m{%»' +%&+»') = m (5(D + |Wi)), 

m(xP> A #+» a gov+i )') 

= m(x^) A |W+ a % (N+l) ), 
^(^2+^2) = £ m (£< X)2 + %N+l)2) m 

Tf the integration in (5) is performed as described, one obtains 

J (2 " t2> J ^+ lo sfMA N+1) -M N+lv )x 

x |^V+1)_^(1)| dbdxdxW...d-x<Wd%u . (7) 

where instead of x°> the centre x = |(x a >+x< iV + 1 >) is introduced. 
Here/( iV+1 > means ? (Ar+1) ) which can he replaced, accord- 

ing to formula (6.40) of the text, by 

J J / 2 (X^+», X<^+ 2 >, ?W+2)) dxW+S^+2). 

If this is introduced into (7) one has an integral over 22^+4 
variables, where the integrand contains the factor f N f a —f N f a . 
By repeating the procedure one can transform (7) into the 
expression 

W = l(N-i)\}* / ( - y J ( l + l °sfMF N -f N r N )x 

x dbdxdx®. ..dxWdx<- N +v. ..dx^d^flE ,®. . .d^W, 
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where F N is the function, obtained by replacing the variables 
x® and I® in f N by x <t ' +w and respectively. 

Now one can apply the same transformations as for gases, 
as explained in Appendix, 17, which lead from (17.3) to (17.4), 
exchanging the dashed and undashed variables, and exchanging 

the two groups (1,2 N) and (1+iV, 2+ A",..., 2N). As it is 

obvious that the integral is invariant for these changes, one 
obtains 


§ = ® * 

X |?W+1)_|0)| dbdxdx< 2 >...d^dx< N + 2 L.d^^>...d^(^, (8) 

which mates it clear that dS/dt is positive or zero, and that the 
latter happens only if 

= (®) 
The solution of this equation leads again essentially to the 
canonical distribution. I shall, however, not reproduce the 
derivation but refer the reader to the original papers: 

M. Bom and H. S. Green, Nature, 159, pp. 261, 738 (1947). 

Proe. Roy. Soc. A, 188, p. 10 (1946) ,- 190, p. 466 (1947) ; 

191, p. 168 (1947); 192, p. 166 (1948). 

H. S. Green, ibid. 189, p. 103 (1947); 194, p. 244 (1948). 

The reader may compare this involved and, in spite of the 
complication, not quite rigorous derivation from classical theory 
with the simple and straightforward proof from quantum theory 
given in section IX. 

I wish to add an argument, also due to Green, which shows 
that once the increase of entropy is secured the distribution 
approaches the canonical one. The latter is given by 


/.v = e a ~P E , a = -4, R = ± 
kT’ p kT 

A is the free energy and E the energy, given by 

E = im^(|®— u®) a +£ 2<E>®, 

u® being the macroscopic velocity at the point x® 
Let the actual distribution be 


( 10 ) 

( 11 ) 


ft* — /Sr+/w! 


( 12 ) 
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one has 

jjf N dxd% = Nl, jjf N Edxd% = AT! 17, 

where U is the internal energy, and the same holds, of course, for 
f%, so that 

jjf N dxd% = 0, = 0. (13) 

Then 

/ivlog/tf = (/^+/W)log{/^l+^)} 

- Al°g/k+/*(log/k+l)+ ||+.- (14) 

Hence 

5 = “Fi JJ {/^ 1< >gA+A( a -^+l)+^+...} dxdt 


Here the terms linear in f' N vanish in virtue of (13), and one 


obtains 




(15) 


This shows that an increase in the value of S requires a decrease 
in the average value of |/^| and therefore an approach to the 
canonical distribution. 


25. (VIII. p. 75.) Atomic physics 
It seems impossible to supplement this and the following 
sections, which deal with atomic physics in general, by appen- 
dixes in the same way as before. The reader must consult 
the literature; he will find a condensed account of these things 
in my own book Atomic Physics (Blackie & Son, Glasgow; 
4th edition 1948), which is constructed in a similar way to the 
present lectures; the text uses very little mathematics, while a 
series of appendixes contain short and rigorous proofs of the 
theorems used. For instance, Einstein’s law of the equivalence 
of mass and energy is dealt with in Chapter III, § 2, p. 62, 
and a short derivation of the formula e = me 2 given in A. Ph . 
Appendix VII, p. 288. Whenever in the following sections I 
wish to direot the reader to a section or appendix of my other 
book, an abbreviation like (A. Ph . Ch. III. 2, p. 52; A. VII, 
p. 288) is used. 
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26. (Vm. p. 77.) The law of equipartition 
If the Hamiltonian has the form (8.5), or 

E = ,+H', « = ?p, (I) 

where S' does not depend on & one has for the average of c in a 
canonical assembly 

J ee-P s dpdq j* ee-P'dg j 

€ "" / ^dpdq ~ fe-P'dg ’ ^ = IcT’ 

as ah other integrations in numerator and denominator cancel. 
Now this can be written 


i= -Tp l °% Z ’ Z = S^di. ( 2 ) 

— 00 

If the integration variable r, = ^a/2)$ is introduced one gets 

Z = pr*A, 

where A is a constant. Hence log 2 = const. -£log/J and 

i = ^ = i kT ’ (3) 

in agreement with (8.6). 

27 J ; ^ p - 9 !-> O*™** calculus to quantum mechanics 
of matnx mechanics to deal with aperiodic motions 

^tead of matnces is contained in a paper by M Born 

general operators in 

1 * wnere m Particular operators of the fonn 

r 

5? /««. «)■••* 

'"re Hun the kernel gfteTi. . 'continucm mate-, alee 
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introduced by Dirac. This paper contains also the representation 
of special quantities by differential operators (with respect to 
time) which satisfy identically the commutation law between 
energy and time Et—tE = iH. 

SchrOdinger’ s discovery, which was made quite independently, 
consists in using a representation where the coordinates are 
multiplication operators and the momenta differential operators, 
so that the commutation laws 

9*Pl3-Pt}q* = MKp 

are identically satisfied. This opened the way to finding the 
relation between matrix mechanics and wave mechanics and 
to the later development of the general transformation theory of 
quantum mechanics which is brilliantly represented in Dirac’s 
famous book. 

The early development of quantum mechanics as represented 
in text-books has become rather legendary. To mention a few 
instances: the matrices and the commutation law = 1 
which are traditionally called Heisenberg’s, are not explicitly 
contained in his first publication: W. Heisenberg, Z. f. Phys. 

33, 879 (1925); his formulae correspond only to the diagonal 
terms of the commutator. The complete formulae in matrix 
notation are in the paper by M. Bom and P. Jordan, Z.f. Phys. 

34, 858 (1925). Further, the perturbation theory of quantum 
mechanics, traditionally called SchrOdinger’s, is contained 
already in the next publication of Heisenberg, Jordan, and 
myself (quoted at the beginning), not only for matrices, but also 
for vectors on which these matrices operate, and not only for 
simple eigenvalues, but also degenerate systems. The only 
difference of SchrOdinger’s derivation is that he starts from a 
representation with continuous wave functions which he aban- 
dons at once in favour of a discontinuous one (by a Fourier 
transformation). 

28. (IX. p. 94.) General formulation of the uncertainty 

principle 

The derivation of the most general form of the uncertainty 
principle can be found in my book (A. Ph. A. XXII, p. 326). 
As it is fundamental for the reasoning in these lectures, I shall 
give it here in a little more abstract form. 
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operator ° = A+iB md its 
real a£d no^ a^afeaf ™ Jue rf th * P 0 *** 00* “ 

5C*>0. (l) 

SZ ^t 1 ^ 08 v “T of Jine “ u™r.ging, aa d«aribed 

U+iXB)(A^iXB) = AS+m*-rf^B ]A > 0, (2) 

where the abbreviation ( 9 . 4 ) 

[A,B] = 1{AB~BA) (3) 

^ ’ff '! 1 " 14 *“* of ( 2 > la real and also the first tivo 

gZ “ f< ’“ 0WS K3I f ~i The mini 

Of the quadratic expressmn in A, given by (2), occuia when 


> = dggj 

2 £S 


and it is equal to 
Hence 


42 _| 2 [A£P 

4 52 • 


4 2 .j?2^^P7^2 


(4) 


unchanged. pJS* ( f ^ “““utato [4, B] remains 

si= { (5n«.. 

one obtains from ( 4 ) the formula (9.3) of th* text, 
and as [q,p\ = 1, especially (9.5), 


( 5 ) 


8p.8z>2. 


(6) 


This derivation reveals the simple algebraic root of tT.» 

temty relation. But it is net LXT,fS 

meaning of this mlstion for spe*S a^?^ £ 
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be found in A. Ph. A. XII, p. 296, A. XXXII, p. 367, and in 
many other books, for instance, Heisenberg, The Physical 
Principles of the Quantum Theory. 


29. (IX.p.97.) Dirac’s derivation of the Poisson brackets 
in quantum mechanics 

It is fashionable to-day to represent quantum mechanics in an 
axiomatic way without explaining why just these axioms have 
been chosen, justifying them only by the success. I think that 
no real understanding of the theory can be obtained in this way. 
One must follow to some degree the historical development and 
learn how thingB have actually happened. Now the decisive 
fact was the conviction held by theoretical physicists that many 
features of Hamiltonian mechanics must be right, in spite of the 
fundamentally different aspect of quantum theory. This con- 
viction was based on the surprising successes of Bohr’s principle 
of correspondence. In fact, the solution of the problem consisted 
in preserving the formalism of Hamiltonian mechanics as a whole 
with the only modification that the physical quantities are to be 
represented by non-commuting quantities. 

If this is accepted, there is a most elegant consideration of 
Dirac which leads in the shortest way to the rule for translating 
formulae of classical mechanics into quantum mechanics. It 
starts from the fact that classical mechanics can be condensed 
into the equation 

!-[«/] = o. <‘> 


which any function f{t,q,p) representing a quantity carried by 
the motion must satisfy. Here the Poisson bracket is used 



_a£ 8rj\ 

QPr 8 Qrl ’ 


(2) 


If (1) is to be generalized for non-commuting quantities, it is 
necessary to consider how the Poisson bracket should be trans- 
lated into the new language. 

Dirac uses the fact that these brackets have a series of formal 
properties, namely 

[f.^] = -fo.fi. M = °. 


(3) 
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where c is a constant ; further 

[fi+fa> *?] = [£n ij], 

[f^i+^a] = [L %]+[£ fja], 
and finally [fif 2 > 1 ?] [fi> ’jJfa'^fiEfai 

[£> ’ll 1 ?*] = [fi> ’?J%+ 7 ?x[^ 1 , ijJ. 




( 5 ) 


Here the factors are written in a definite order, though in 
classical mechanics this does not matter. We have to do so if we 
u f e these expressions for non-commuting quantities, 
and the rule followed is simply to leave the order of factors un- 
changed. 

The question is, What do the brackets mean in this case ? To see 

S’* 7 T* ! 7“ ke * [ f ‘ f* >)> Is] fa two ways, using the two 
formulae (5) first m one order, then in the opposite one. Then 

[£i fa> Vi ^a] = [fi> Vi ’Jajfa+fitfa* Vi ^a] 

= &i> V&2+VI&, ’jJJfa+fifl?,, Vihi+Vifo, Vz]} 

~ [£v Vilvsis+Viiiv i?a]fa+fi[fa» yjva+fi Vi[£z> Vz] 
and in the same way 

Vi Vz] = [&&, yihz+y 1 [f 1 £ 2 , V2 ] 

= &> Vi]& Vz+m, Vihz+Vifo, VzJiz+V! £i[£z> Vz]- 
•Equating these two expressions one obtains 

fo.’hl&’Ja-^a) = (fiy 1 ~y 1 f 1 )[? 2 , Vi ]. (6) 

As&> Vi] must be independent of£ 2 , V2 and vice versa, it follows 

fiVi—Vifi = A[fv Vi], 

fa Vz~ 7 lz$z = ^[fa> VtH, ^ 

wtoe Aisind^ndent of .Jlfcu* quantities and oommutes with 
beluiiwhHm • ,Va Hence A IS a number. Thatitmust 




( 8 ) 


is real. Thus it is established that the Poisson brackets in 
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quantum theory correspond to properly normalized commu- 
tators. 

If one inserts in the classical expression (2) for | and tj a 
coordinate or a momentum, one finds 

[fluffs] = \jPr>!Pa[ = == (9) 

where S„. = I, 8 ra = 0 for r ^ a. 

The same relations (9) must be postulated to hold in quantum 
mechanics. In this way the fundamental commutation laws are 
obtained. 

30. (IX. p. 100.) Perturbation theoryforthedensitymatrix 
We consider the problem of solving the equation 

| = [3,p], H = H 0 +V, (1) 

where the perturbation function F is small. 

The method is essentially the same as that used for the 
corresponding problem in matrix or wave mechanics. 

Assume that A represents the eigenvalues of a complete set 
of integrals A of H 0 , so that [H 0 , A] = 0 and H 0 becomes 
diagonal in the A-representation; put 

H 0 ( A, A) = E, H 0 ( A', A') = E', while H 0 (\, A') = 0 for A =£ A'. (2) 
Introduce instead of p and F the functions <x and U given by 
p(A,A') = a(A,A')e«»x E -^, 

F(A,A') = UiWeWW-w. ^ 

Then one has 

ftdpfa X) = (ft [ (jg_^> )(T (x,A' )Wxg-^ 

i dt dt j 

(H 0 p—pH 0 )(X, A') = (E-E')o{ 

Hence the equation (1) reduces to 
ftda^E) = (£7o ._ 0 . i7 j( A 
i dt 

= 2 {C7 (A, A')— ^A, A'J^A", A')}. (4) 

A' 

Now assume that o- is expanded in a series 

cr = cro+ 0 l+° , 2+'"> W 

o 


6131 



194 


APPENDIX 


where a 0 is diagonal and independent of the time and or x , cr 2 ,... 
of order 1, 2,... in the perturbation. Then one obtains 


hence 

where 


= * C7(A,A> 0 (A')-c 0 (A)}, 
<?i(A, A') == u(X, A'){er 0 (A') — cr 0 (A)}, 


t . 1 

«(A,A') = ij U(X, A') dt = | J F(A, X'^W-BVdt. (7) 

0 0 

It follows for the diagonal elements from (6) that 

°i(A, A) = 0. (8) 

The next approximation or 2 has to satisfy the equation 

8 -^~ = {U(A,A> 1 (A' ) A')-a 1 (A, X")U(y,X')} 

= i|[ C 7(A ) A>(A" ; A> 0 (A')-. 0 (A')}- 
—u(X, X")U(X“, A'){c 0 (A)— (7 0 (A")}] 

-«(A, A") — (g 0 (A)— g 0 (A <l )}j . 

We need only the diagonal elements; for these one has 

“ 1 2 U(X> A X A '> A ){ ff o(A')-ao(A)}, 

A' 

which gives by integration 

ff * (A > A) = | |w(A, A') |> 0 (A')-o 0 (A)}, (9) 

since, according to (7), w(A, A') is hermitian, u(X,X') = 

and vanishes for t == 0. K ' ' 

(3) “** tte dia * 0 “ I Momenta of p end <, are 
identioal, they represent the probability P(i,A) of finding the 
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system at time fin the state A. Now (5), (8), and (9) give, in agree- 
ment with (9.25) of the text, 


where 


P(t, A) = P(A)+ J J( A, A'){P(A')— jP(A)}+.„, (io) 

I * a 

J(A, A') = |«(A, A')! 8 = p J F(A, A .'Je-CWCM*# . (n) 


When F(A,A') is independent of the time one can perform the 
integration, with the result 


7(A,A') = ^|F(A,A')| 2 




(i/WE-E') 


( 12 ) 


1 

1 — fiyt 

_ sin^yf 

2n 

iy 

Try 


like a Dirac 8-funotion, i.e. one has 


J_ f I I — e*y* 
2w J j iy 

Ay 


/(y)dy-»/(0)f, 


if the interval of integration Ay includes y = 0 and if <Ay 1. 

Suppose that the energy values are distributed so closely that 
they are forming a practically continuous speotrum. Then one 
can split the index A into (A, E) and replace the simple summation 
in (10) by a summation over A and an integration over E; the 
latter can be performed on the coefficients J(X, E; A ',E') with 
the result that the formula (10) is unchanged, if the coefficients 
are given by 

J( A, A') = ^ |F(A, \')\*8(E-E'), (13) 


which combines the equations (9.27) and (9.28) of the text. 

As mentioned in the text, Green has found a formula which 
allows one to calculate the higher approximations in a very 
simple way. This formula is so elegant and useful that I shall 
give it here, though without proof (which can be found in the 
Appendix I, p. 178, of the paper by M. Bom and H. S. Green, 
Proc. Roy. Soc. A, 192, 166, 1948). Starting from the equation 
(7) or .• 


• * TT 

n = l U, 


(U) 
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where U is known for a given perturbation V by (3), one forms the 
successive commutators 


^22 = UU—UU, w as 5= U 22 U— UU 2 29 — , (15) 

and from them the expansion 


u 2 = 


1^2 

2 ! 




(16) 


If the initial condition u 2 = 0 for t = 0 is added, u t can be deter- 
mined by integrating this series term by term 
Then one forms 


— « 2 « a — V^Ui, W 38 = U^Ui— u 2 u M , 

and the expansion 

U g = jjg4 1 , 3m 8 8 I 

8 2! r 3! + 4! i "•" , 


(17) 

(18) 


from which one can determine u t so that = 0 for t = 0. 

The second su ffix l in has been chosen to indicate the power 
of V which is involved in the expression; one has u k = 0(F 2 *' 1 ) 
and this decreases rapidly with 1c when V or t are small. This rule 

makes it possible to construct u it in a similar manner. 

Then one has the solution of (4) 


or = e“e u *e“*...p 0 ...e-“»e-“«e-“ (19) 

from which p is obtained by (3). 

The explicit expressions for the expansion (5) of a are 
CT i = up 0 —p 0 u, 

= i(w 2 p 0 — 2 up 0 u+p 0 u z )+$(u 2 i p 0 — p 0 u i2 ), 

°s = Uu > p 0 -^u i p 0 u+Zup 0 u i -p 0 u s )+ 

+iM%a Po-Po «2 2 )~ («22 Po-Po «2>}+i(«28 P 0 ~Po «2s)r- • 

( 20 ) 

These formulae ■will be useful for many purposes in quantum 
theory. Concerning thermodynamics, the third-order terms will 
have a direct application to the theory of fluctuations and 
Brownian motion. The customary theory derives these pheno- 
mena from considerations about the probability of distributions 
in an assembly which differs from the most probable one. The 
theory described here deals with one single system with the 
methods of quantum mechanics (which allows anyhow only 
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statistical predictions) ; deviations from the average -will then 
depend on higher approximations. It can be hoped that this idea 
leads to a new approach to the theory of fluctuations in quantum 
mechanics. 

31. (IX. p. 112.) The functional equation of quantum 

statistics 

The equation (9.37), 

P(Em+E(*>) = P 1 (A<i))P a( A(2)) ) 

where A (1) depends on but not on E®\ and A (a) vice versa, is 
obviously of the form 

f(x+y) = <f>{x)<j>(y) 

treated in Appendix, 13, and has as solution general exponen- 
tial functions; hence the distribution for all three systems is 
canonical. 

32. (IX. p. 113.) Degeneration of gases 

The theory of gas degeneration is treated in my book Atomic 
Physics , but in a way which appears not to conform with the 
general principles of quantum statistics as explained in these 
lectures. According to these one always has in statistical equili- 
brium canonical distribution, P = e 0£ “ft E , while the presenta- 
tion in A. Ph. gives the impression that, by means of a modified 
method of statistical enumeration, a different result is obtained. 
This impression is only due to the terms used, which were those 
of the earlier authors (Bose, Einstein, Eermi, Sommerfeld), while 
in fact there is perfect agreement between the general theory 
and the application to gases. A simple and clear exposition of 
this subject is found in the little book by E. Schrfldinger, 
Statistical Thermodyriamics (Cambridge University Press, 1946). 
I shall give here a short outline of the theory. 

In classical theory an ideal gas is regarded as a system of inde- 
pendent particles. In quantum theory this is not permitted, 
because the particles are indistinguishable. If ^(x^) and 
^ 2 (x< 2 >), or shortly 0 X (1) and j/r 2 (2), are the wave functions of two 
identical particles with the energies E x and E 2 , the SchrOdinger 
equation for the system of both particles, without interaction, 
has obviously the solution ^(l)^^) "with the energy E x +E 2 ; 
but as the particles are identical there is another solution 
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belonging to the same energy, namely ^(2)^(1). Hence any 
linear combination of these is also a solution. Two of these, 
namely the symmetric one and the anti-symmetric one, 

U 1, 2) = ^lM a (2)+&(2)&(l), 

■£„(!, 2) = ^(1)^(2)-^(2)^(1) } 

have a special property: the squares of their moduli, |^ 8 |* and 
|^r a | 2 , are unchanged if the particles are interchanged. One can 
further show that they do not ‘combine’, i.e. the mixed inter- 
action integrals (matrix elements) vanish, 

J &(1, 2)Aip a (l, 2) dxWx® = 0, (2) 

for any operator A symmetric in the particles. Hence they repre- 
sent two entirely independent states of the system; each state 
bemg characterized by two energy-levels of the single particle 
occupied, without saying by which particle. 

The same holds for any number of particles. If E l 9 E 2 ,...,E n 
are the energies of the states of the isolated particles, the total 
system (without interaction) has not only the eigenfunction 
^i(l)^ 2 (2)«.^ n (ft) belonging to the energy E 1 +E 2 +...-\-E n but 
all functions P^r 1 (l)^r 2 (2)...^f n (7t), wherePmeans any permutation 
of the particles, hence also all linear combinations of these. 
There are in particular two combinations, the symmetric one 
and the antisymmetric one, 


&(1, 2,..., n) = | P^(1)^(2)...^( W ), \ 

^o( 1 »2,...,») = ^±Pt/r 1 (l)i/i i {2)...tft n (n) (3) 

(+ for even, — for odd permutations), , 


which have the same simple properties as described in the case 
of two particles; remains unaltered when two particles are 
exchanged, while *p a (which can be written as a determinant) 

IWW T gn; J 1 ? Ce J^I 2 and l*«l a remain unchanged. 
Fm^the two states do not combine, a feet expressed by 

, J5® ^ mCti °r s ** ^ h describe the state of the ^-particle 

SStwr ? 'S? ^ tHe Parti0les have lo8t ^eir indi- 

comts “ *» 

Erperim^t haa shotm that thia deaoripUon i. Equate for aU 
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particles in nature; every type of particle belongs either to one 
or the other of these two classes. 

The eigenfunctions of electrons belong, in view of spectroscopic 
and other evidence, to the antisymmetric type ; hence they vanish 
if two of the single eigenfunctions are identical, i.e. if two 
particles are in the same quantum state. This is the mathe- 
matical formulation of Pauli’s exclusion principle. Nucleons 
(neutrons or protons) and neutrinos are of the same type; one 
speaks of a Fermi-Dirac (FD.) gas. Photons and mesons, 
however, and many nuclei (containing an even number of 
nucleons) are of the other type, having symmetric eigen- 
functions; they form a Bose-Einstein (B.E.) gas. 

In both cases the total energy may be written 

E = Mi en+Jij e a -)- ... = 2 n s e s> W 

« 

where e v e a> ... are the possible energy-levels of the single particles 
and %, n 2> ... integers which indicate how often this level appears 
in the original sum E 1 +E a +...+E n (where each E k was attri- 
buted to one definite particle). 

The sum of these occupation numbers %, n 2 ,... 

n 1 +n 2 +... = J i n s — n (5) 

8 

may be given or it may not. The latter holds if particles Are 
absorbed or emitted, as in the case of photons. For a B.E. gas, 
including the case of photons, there is no restriction of the n 8i 
while for a FJD. gas each energy- value e a can only appear once, 
if it appears at all. Hence one has the two cases 

(B.E.) u 8 = 0, 1, 2, 3,... 

(F.D.) ti 8 = 0,1. 

Now we apply the general laws of statistical equilibrium, which 
have to be supplemented by the fundamental rule of quantum 
mechanics that each non-degenerate (simple) quantum state has 
the same weight. (This is implied by the equation (9.14) of the 
text which shows that the diagonal element of the density matrix 
determines the number of particles in the corresponding state.) 
As we have seen in Appendix, 14, it suffices to calculate the 
partition function Z (14.24), p. 158, with all a> 8 = 1, 

% — ^ e~P E(:ril,nt, "\ 

n v n v ... 


( 7 ) 
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where the sum. is to be extended over aJl quantum n um bers n a , 
which describe a definite state of the system. These are just the 
numbers introduced in (4), with or without the restriction (5) 
according to the type of particle. Introducing the abbreviation 

z t = e-fo (8) 

one has 

z - ff--!«P2*r~-nx«sf- <») 

mi*?**... fti na 8 

The sum is easily evaluated for the two cases (6), 


(B.E.) J**. = 1 +Z'+z*+2»+... = 


1-z/ 


(T.D.) Js?*=l+z s . 

One can conveniently combine the results into one expression 
Z = TT(1 =Fz / in \ 

’ /TP TT \ ( 10 ) 


wb J^ upper ^ ref ers throughout to the B.E. case. 

Itus formula contains the theory of radiation, where the 
? on . ^ l oes no ^ a pp!y- But it is more convenient to deal 
SlIS (5) MdS “ d *** to condition in 

at ‘*“ original form (9) of 2 shows that the oondition 

Kj ° f — ' Uoh - 

fo^^ , L“ o4ofoompbI “ teeiato - We 

f(0 = n(i=F6g« (11) 

powers of C- The coefficient of C n is obviouslv 

^oao^odnottSJwiththotestnotionit). hLowsoSS 

nr 1 P 


z = id$ 


(is) 

that no other o“ *** a W&3? 

only one ndnhnnm on ^ “ d 
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As in previous cases (see Appendix, 14, 15) the crudest 
approximation suffices. One writes the integrand in the form 

Wh6re 9(t) = — (»+l)log £+log /(£), 

and determines the minirruTm of the function g{£) from 

n + 1 . d 


g'(t) = ~^+^los /(£) = 0; 

then one has to calculate g(Q and 

»H-1 . d 2 


(13) 


*'(0 = 


£2 - d £2 


log m. 


for the value of £ which is the root of (13). To a first approxi- 
mation one finds 


log Z = — (n.+ l)log£+log/(£)— \\og{2'ng"(V)}. 

Neglecting 1 compared with n and the last term (which can 
be seen to be of a smaller order), one obtains 

log Z = wlog £+log/(£); (14) 

here £ is the root of (13), where also n + 1 can be replaced by n. 
Now one gets from (11) 

log/(£) = 2 Tlog(lT£z s ), 

8 

dlogfU) _V z, (16) 

di 4 1 = f&' 

Hence, from (8) and (13), 

2 e ^ T i =n { p = 1/kT.^ (16) 

From this equation a (or £) can be determined, as function of 
the particle number and of temperature. One easily sees now 
that the case where the number of partioles n is not given is 
obtained by just omitting the equation (16) and putting a = 0 
or £ = 1 . Yet the equation ( 1 6) is not entirely meaningless now, 
it gives the changing number of particles actually present. 
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The mean number of particles of the kind s is obviously 
- _ 183 Id log Z m 

* 2 ^Sj8e s j8 Se s 5 

hence, from (14) and (15), 

*-;s*=Fi' < 17 > 


which co nfirms (16) with (5). This formula, for the B.E. case 
(minus sign), has been mentioned in VTII (8.20), where it was 
obtained by a completely different consideration of Einstein’s. 
In the same way, the average energy of the system is found to 

in agreement 'with (4). 


These are the fundamental formulae of quantum gases, derived 
from the general kinetic theory. They are to be found in A. Ph. 
Ch. VH, p. 197; in particular the fundamental formula (17) in 
§5, p. 224, for B.E., and in § 6, p. 228, for F.D. All further 
developments may be read there (or in any other of the 
many books dealing with the subject). I wish to conclude t his 
presentation by giving the explicit formulae for monatomic 
gases, where the energy is e = p i /2m and the summation over 
cells is to be replaced by an integration over the momentum 
space. The weight of a cell is found, by a simple quantum- 
mechanical consideration, for a single particle without spin 
(A. Ph. Ch. VII, § 4, p. 215) to be 


Hence, introducing the integration variable 

x = p /JL = ft 

2m J(2mJcT)’ 
one obtains from (17) and (18) 


n = ^jI(2mkT) i l* f x * dx 

A 8 y J ga+a^ipi* 

0 

(19) 

V- *£(***)«. J_£ 

(20) 
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which are the quantum generalizations of the formulae given in 
Appendix 14 and reduce to them for oc -> oo. A detailed discus- 
sion would he outside the plan of this book. It need only he 
said that theF.D. statistics of electrons have been fully confirmed 
by the study of the properties of metals (A. Ph . Ch. VII. 7, 
p. 229; 8, p. 232; 9, p. 235; 10, p. 236; A. XXX, p. 352). 


33, (IX. p. 116.) Quantum equations of motion 
At the end of Chapter VI, which deals with the kinetic theory 
of (dense) matter from the classical standpoint, the statistical 
derivation of the phenomenological hydro-thermal equations 
was mentioned and reference made to this later Appendix, which 
belongs to quantum theory. This was done to save space; for 
the classical derivation is essentially the same as that based on 
quantum theory, and one easily obtains it from the latter by a 
few simple rules. 

The first of these rules is, of course, the correspondence of the 

normalized commutator [a, 81 = -i(aj8 — /Ja) with the Poisson 

u in 


bracket 


M] = 2(; 
1 ' 


/ doc 

doc dp \ 

\ax® sp w 

0p®0X®j : 


(i) 


if and are the position and momentum vectors on which 
a and /? depend. 

The second rule concerns the operator which in the text is 
described in words; expressed in mathematical symbols it is 


x a ... = JJ iiWxM' 

It has to be interpreted classically to mean 



Thus the classical operation J corresponds to 


( 2 ) 

( 3 ) 


i.e 

to _ 

unless a and j3 commute; in general one must replace by 

{“£} = J(« P+M- 

By applying these rules one can easily go over from quantum 


. to substituting for x^ as stated in the text. 

In using the correspondence principle to proceed from classical 
Quantum mechanics a product aS may not be left unchanged 
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kscs.“e=s-> ”~ 

& , the following ones are precisely similar. One has: 

B ’ fl "->+2Sl >W)! + (4) 

JT- '^f 8 4116 ° ler “ im 40 (!U,6, • 0,16 obti,to8 ' «*« 

dfty-i __ 




(6) 


ft-., PAr] , 

The middle term on the right-hand side is 

2mM Xr ^ 1?m - (V (N) P n+p n P (N) ) — (pW PA ,-(- Pn pcw>) .pWj 

i r a 
m J 

Se T S^?o tote « raI - 

pl r„it‘i^ssrr~ Z: 

^StaSnSwS molecnlM d “~£°rZ: 

quantities. The generahVeH ‘j. •*. . terms _ of the molecular 
ordinary numberdensity % for , reduc ^ to the 
of the positions *» y®lL JL- *’ ° btamed as a function 
in a* density mt£, ht-TS? - *° « - K 2,..., *) 
for a molecule (*) in tie dLt«t of o m a wT° S °u P ‘° ™ looit:! ' u “ 

given is the average value «f olec L ul6s whoBe positions are 

verage value of the quantity represented by the 

operator — p<9- 
m 

1 


u«> = 


’ (6) 
elements of the matrix. pt X ~ x tbe diagonal 
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By expressing (9.48) in the coordinate representation and 
■writing x®' = x®, one obtains the equation of continuity 


Bn. 


8t 


_ 


' d 
?0x® 




(7) 


1 d 
"m0x® 


since 

-(P^a+PaP^-P^-x 

•{p a P W) k'- x . 

l/t» VA’ r 

and 

= [{<D(|x®— x®[)— <£(|x®'— x®'|)}p(x, x')] x ,„ x = 0. 

Next, multiply (9.48) by the operator p® before and after, taking 
half the sum, and then write x®' = x® (j = 1, 2,...,q). 

d 

The left-hand side evidently reduces to m — (n a u®). One has 

ot 


further 


and 








{P ( Wi[^ + V a+ i]}x'-x = “ J», 

Hence, if a tensor l ( ® is defined by 

1 8 .!«« = », J 8 ®‘“ 


,1 

one has 


ax® 


y-i 


ax® 


■/ 


f 


'a ax® ’ 

g<J>(*a-i) 

>a+1 ax® 

0®«z+u 


dx<a+». 




0X® 


dxte+w. 


»!<»,<>+ % ^.[±WV>*-*+H= °- (8) 

j=l 

By using (7) one obtains 

!<»„»?>) = »*&- J 
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or, if djdt is the convective derivative — -U v ~ 

« ‘&B®* 

( 9 ) 

1-1 

Hence (8) may be written in the form 
dujp ^ s 

mn ‘St+ 2, sxfi ■<*” = ’>■ do) 

i- 1 

Wllere ptf« = k«C_(.|W 

kW = ~{P*(P <i) P«)} X '-x~ mw « VjpVfP (11) 

= m{vW(v<%)} x ._ x ; 

here v« = I pW_ u «> S(xW-xW') (12) 

is Hie relative molecular velocity referred to the visible motion. 

The equation (10) is the generalized equation of motion of the 
cluster ' of q molecules, which reduces to the ordinary equation of 
hydrodynamics when q = 1 . 

fT*’, generalized Pressure tensor, is seen to consist of two 
**** and ^ associated with the kinetic energy of motion 
and the potential energy between the molecules respectively. 

The diagonal element of the tensor kW is a multiple of the 
kinetic temperature defined by 

= £m{v®(v<%)} x ,„ x . (13) 

The equation of energy transfer can be obtained in the same way 
as the equation of motion by calculating the rate of change with 
tune of Tg\ 


34. (IX. p. 118.) Supraconductivity 

There exists a satisfactory phenomenological theory of supra- 
conduotivity, mainly due to F. London; it is excellently pre- 
sented in a book by M. von Laue, where the literature can be 
found. (Theorie der SuprcOeitung: Springer, Berlin u. Gottingen, 

Many attempts to formulate an electronic theory have 
been made, without much success. Recently W. Heisenberg 
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has published some papers (Z. f. Naturforschung, 2 a, p. 185, 
1947) which claim to explain the essential features of the pheno- 
menon. According to this theory every metal ought to be supra- 
conductive for sufficiently low temperatures. Actually the alkali 
metals which have one ‘free’ electron are not supraoonduotive 
even at the lowest temperatures at present obtainable, and it is 
not very likely that a further decrease of temperature will change 
this. There are also theoretical objections against Heisenberg’s 
method. 

A different theory has been developed by my collaborator 
Mr. Kai Chia Cheng and myself, which connects supracon- 
ductivity with certain properties of the crystal lattice and pre- 
dicts correlations between structure and supraoonduotive state, 
which are confirmed by the facts (e.g. the behaviour of the alkali 
metals). The complete theory will be worked out in due course. 


35. (X. p. 124.) Economy of thinking 

The ideal of simplicity has found a materialistic expression in 
Ernst Mach’s principle of economy in thought (Prinzip der 
Denk-Okonomie). He maintains that the purpose of theory in 
science is to economize our mental efforts. This formulation, 
often repeated by other authors, seems to me very objectionable. 
If we want to economize thinking the best way would be to stop 
thinking at all. A minimum principle like this has, as is well 
known to mathematicians, a meaning only if a constraining 
condition is added. We must first agree that we are confronted 
with the task not only of bringing some order into a vast expanse 
of accumulated experience but also of perpetually extending this 
experience by research; then we shall readily consent that we 
would be lost without the utmost efficiency and clarity in think- 
ing. To replace these words by the expression ‘economy of 
thinking’ may have an appeal to engineers and others interested 
in practical applications, but hardly to those who enjoy thinking 
for no other purpose than to clarify a problem. 


36. (X. p. 127.) Concluding remarks 

I feel that any critical reference to philosophical literature 
ought to be based on quotations. Yet, as I have said before, my 
reading of philosophical books is sporadic and unsystematic, 
and what I say here is a mere general impression. A book which 
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I have recently read with some oare is E. Cassirer’s Determinis- 
mvs und Indeterminismus in der modemen Physile (GOteborg, 
Elanders, 1937), which gives an excellent account of the situa- 
tion, not only in physics itself but also with regard to possible 
applications of the new physical ideas to other fields. There one 
finds references to and quotations from all great thinkers who have 
written about the problem. The last section contains Cassirer’s 


opinion on the ethical consequences of physical indeterminism 
which is essentially the same as that expressed by myself. I 
quote his words (translated from p. 259): ‘From the significance 
of freedom, as a mere possibility limited by natural laws, 
there is no way to that “reality” of volition and freedom of 
decision with which ethics is concerned. To mistake the choice 
(Auswahl) which an electron, according to Bohr’s theory has 
between different quantum orbits, with a choice (Wahl) in 
the ethical sense of this concept, would mean to become the 
victim of a purely linguistic equivocality. To speak of an ethical 
choice there must not only be different possibilities but a con- 
scious distinction between them and a conscious decision 
about them. To attribute such acts to an electron would be a 


gross relapse into a form of anthropomorphism ’ Concerning 

the inverse problem whether the ‘freedom’ of the electron helps 
us to understand the freedom of volition he says this (p. 261): 
‘It is of no avail whether causality in nature is regarded in 
the form of rigorous “dynamical” laws or of merely statistical 
laws. ... In neither way does there remain open an access to 
that sphere of “freedom” which is claimed by ethics’. 

My short survey of these difficult problems cannot be com- 
pared with Cassirer’s deep and thorough study. Yet it is a 
satisfaction to me that he also sees the philosophical importance 
of quantum theory not so much in the question of indeterminism 
but in the possibility of several complementary perspectives or 
aspects in the description of the same phenomena as soon as 
different standpoints of meaning are taken. There is no unique 
image of our whole world of experience. 


Appendix, added after delivering the lectures, gives 

TdS 6 t0 6XpreSS thanks t0 my 

^ t0 5 SCUSS P roblems and to Praise 
objections. One of these was directed against my expression 
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‘observational invariants’ ; it was said that the conception of 
invariant presupposes the existence of a group of transformations 
which is lacking in this case. I do not think that this is right. 
The problem is, of course, a psychological one; what I call 
‘observational invariants’ corresponds roughly to the Gestalten 
of the psychologists. The essence of Gestalten theory is that the 
primary perceptions consist not in uncoordinated sense im- 
pressions but in total shapes or configurations which preserve 
their identity independently of their own movements and the 
oVinnging standpoint of the observer. Now compare this with 
a mathematical example, say the definition of the group of 
rotations as those linear transformations of the coordinates 
x,y,z for which x t -\-y 2 +z 2 is invariant. The latter condition 
can' be interpreted geometrically as postulating the invariance 
of the shape of spheres. Hence the group is defined by assuming 
the existence of a definite invariant configuration or Gestalt, not 
the other way round. The situation in psychology seems to me 
quite analogous, though much less precise. Yet I think that this 
analogy is of some help in understanding what we mean by real 
tJnngfl in the flow of perceptions. 

Another objection was raised against my use of the expression 
‘metaphysical’ because of its association with speculative sys- 
tems of philosophy . I need hardly say that I do not like this kind 
of metaphysics, which pretends that there is a definite goal to be 
reached and often claims to have reached it. I am convinced 
that we are on a never-ending way; on a good and enjoyable way, 
but far from any goal. Metaphysical systematization means 
formalization and petrification. Yet there are metaphysical 
problems, which cannot be disposed of by declaring them 
meaningless, or by calling them with other names, like epistemo- 
logy. For, as I have repeatedly said, they are ‘beyond physics’ 
indeed and demand an act of faith. We have to accept this fact 
to be honest. There are two objectionable types of believers: 
those who believe the incredible and those who believe that 
‘belief’ mustbe discarded andreplacedby ‘the scientific method . 

Between these two extremes on the right and the left there is 
enough scope for believing the reasonable and reasoning on 
sound beliefe. Faith, imagination, and intuition are decisive 
factors in the progress of science as in any other human activity. 

63 SI 
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